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FRAMES OF REFERENCE IN PERSONALITY ASSESSMENT 
WILLIAM F. SOSKIN! 
University of Chicago 


INTRODUCTION 


The vast increase during the past half-decade in researches directed toward the 
validation of currently popular “diagnostic” devices requires that repeatedly the 
question be asked, ‘‘What is being validated?” in these researches. The report of the 
APA Committee on Test Standards“) points out that “validity is not an absolute 
characteristic of a test”, and proceeds to distinguish four categories of validity, viz., 
predictive validity, status validity, content validity, and congruent validity, to 
which the Committee on Diagnostic Devices of the Division of Clinical and Ab- 
normal Psychology, taking cognizance of some unpublished work by the present 
writer and others, proposed a fifth type, “‘postdictive’”’ validity @?. 

Even these distinctions fail to emphasize adequately that in many instances it 
is not the test which is being validated, but rather the judging skill of a human opera- 
tor. Where some quantifiable performance record is directly correlated with a criter- 
ion, it may reasonably be said that the test is being validated; but when the per- 
formance record serves merely as a basis for inferences which are rendered quantita- 
tive according to the experience or bias of a human observer, it is clearly some pro- 
duct of the interaction between performance record and observer experience which 
is being validated. Only by entertaining the seemingly untenable assumption that 
no significant differences exist among judges in their ability to transmute perform- 
ance records into quantitatively expressed judgments is it possible to claim that the 
correspondence between test-based judgments and some suitable criterion is an index 
of test validity per se. 

Factors which might influence the transmuting characteristics of the judge 
become, under these circumstances, critical determinants of the validity of judg- 
ments. The relationship of judge to subject, type of performance record on which 
the judgment is based, purpose for which the judgment is made, etc., may significant- 
ly alter the frame of reference in which judgments are rendered quantitative. An 
opportunity to observe such shifts in frame of reference presented itself in the course 
of the analysis of various types of ratings collected in the study on the assessment of 
applicants for graduate training in clinical psychology, sponsored by the Veterans 
Administration. *. 

In that study more than 100 applicants for admission to graduate programs in 
clinical psychology at a number of universities were brought to an assessment center 
in small groups throughout one summer for intensive study by a staff of psychologists 
and psychiatrists. In the course of his five-day stay, each candidate was studied in- 
tensively by a number of staff members who sometimes worked as individuals, some- 
times as teams, sometimes in direct contact as in interviews and in role-playing situa- 
tions, sometimes only through the medium of test protocols, handwritten autobio- 
graphies, etc., in ‘‘blind” diagnoses. Staff members were called on to make judgments 
about each candidate at several specified stages throughout the assessment program 
in order that data might be analyzed to learn the contribution of each type of pro- 
cedure to the final appraisal of the candidate. Judgments were made with respect to 
some 40 dimensions of personality and were expressed on a rating scale suitable for 
each dimension. 

From these data it was possible to make comparisons between the types of judg- 
ments expressed by staff members working with different types of instruments or 
procedures. Five types of judgments were examined with respect to 31 of the 40 
rating scales used in the assessment. The remaining nine dimensions referred to job- 


1The author wishes to express his appreciation to Drs. E. L. Kelly and D. W. Fiske for maki 
available the data used in this study. y iske for making 
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performance behavior and were omitted from this comparison. The types of judg- 
ments of particular interest are the following: 


Projective Integration. Hereafter designated Proj-I, these ratings were made by in- 
dividuals considered specialists with the Rorschach and TAT and other projective 
instruments. Of the six teams of judges at work each week, three teams included a 
projective integrator as one member of the three-man unit. This person usually ad- 
ministered the Rorschach personally and, in addition, had available to him for an- 
alysis a TAT protocol, a projective-test version of the Bender-Gestalt test, and a 
sentence completion test. The latter three tests were independently interpreted by 
other psychologists assigned to this work, and their interpretations were available 
to the projective test integrator, who was free to use their interpretations or— as 
most of them preferred—to make his own analyses in the course of integrating the 
projective material. On the basis of his evaluation of the entire battery of projective 
tests the Proj-I ratings were made independently without benefit of discussion with 
other staff members. 


Preconference Ratings. Hereafter referred to as Precon-I, these ratings were made by 
one member from each of three teams on the basis of all the projective tests, plus a 
battery of objective tests, plus an autobiography written by the subject at the assess- 
ment center, plus such biographical data as college credentials, letters of reeommenda- 
tion, ete. Unlike the Proj-I rater, the Precon-I staff member had no direct contact 
with the subject before making his ratings, nor was he provided with the projective 
test interpretations available to the Proj-I rater. All materials available to him had 
been collected by other staff members. Like his Proj-I counterpart, each Precon-I 
rater served as the third member of a regular staff team, so that of the six teams at 
work in any one week, half had Proj-I raters assigned to them and the other half 
each included a Precon-I rater among its members. 


Preliminary Pooled Ratings. Hereafter referred to as Pre-P, this set of ratings was 
arrived at by the team of three judges after joint discussion of preliminary evaluation 
of all available data. At this point in the assessment schedule each team had avail- 
able to it, in addition to the materials studied by either its Proj-I or Precon-I raters, 
the data initially available to a second member of the team, viz., observations and 
information gathered in a one-hour interview with the subject, plus clues gained from 
study of the subject’s college transcript, letters of recommendation and a com- 
pleted Civil Service Form 57. The third member of the team, meanwhile, had access 
to all the tests and documents available to the other two members, and, in addition, 
had conducted a two-hour interview with the subject. All these data and opinions 
were shared and discussed in conference, after which the staff team, as a team, ar- 
rived at a new set of ratings, the Pre-P ratings. 


Final Pooled Ratings. Hereafter referred to as Fin-P ratings, this set of judgments 
was again a team product, determined by the team at its final conference, usually 
two days after the Pre-P conference. In the interim the team observed its subjects 
in a number of standard situations and tests and had had an opportunity to study 
ratings of the subject made by still other staff members, e.g. the men who rated in- 
dividual projective tests, certain observers who saw the subject in role-playing situa- 
tions but knew nothing else about him, etc. Also, one of the team members conducted 
a final brief interview with the subject before the latter was dismissed from the 
assessment center. Hence all available information was used by the team in de- 
termining its Fin-P ratings. 


Situations Ratings, Pooled. Hereafter referred to as Sit-P ratings, these judgments 
were based on observations of subject behavior in a variety of ‘‘situations tests’ 
staged late in the assessment schedule. Some of the situations involved individual 
performances; some were two-person interactions and some were group tasks. When 
this phase of assessment was reached, teams of judges and subjects were paired so 
that judges from teams A and B simultaneously observed student groups A and B. 
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Thus, team A judges, who had previously studied group A subjects for several days 
through interview, test batteries, etc., now observed their own group A subjects in 
the situations test, and, in addition, observed group B subjects with whom they had 
no previous familiarity. Similarly, team B judges observed group B subjects whom 
they had previously studied and group A subjects with whom they had no previous 
acquaintance. Hence, team A’s judgments about group B subjects and team B’s 
judgments about group A subjects provide data for studying ratings based solely on 
performance in situation tests. 


Peer Ratings, Median. Hereafter referred to as Peer-M ratings, these judgments were 
made by the subjects about each other. Each member of the four-man subject group 
was asked to make ratings on each of his team-mates, based on observations and 
impressions gained during the five-day assessment period. Since subjects not only 
performed together, but also lived together in a dormitory and sought recreation to- 
gether during their limited free time, there were numerous opportunities for intimate 
unsupervised contact between subjects. Subjects were asked to make the same types 
of ratings as those made by staff judgments; the median of the three team-mate 
ratings on each variable is used in this comparison. 


POPULATIONS 


The results to be reported here are based on the appraisals of the total male 
population of subjects studied in the assessment, all of them beginning graduate 
students recently accepted by some university for graduate study in clinical psy- 
chology. The N varies for different types of ratings. For Fin-P, Pre-P, Set-P and 
Peer-M ratings N = 128, the total male sample; for Proj-I and Precon-I the N is ap- 
proximately half that size since, it will be recalled, Proj-I raters served on only half 
the judging teams and Precon-I raters on the other half. Because of slight changes in 
scheduling, the total number of subjects rated by Proj-I raters was 67, for Precon-I 
raters, 61. 

The majority of staff members were faculty members of university psychology 
departments or members of hospital staffs. Two were psychiatrists. Several were ad- 
vanced graduate students with considerable experience in psychodiagnostics. Staff 
members were rotated from one judging team to another throughout the summer, so 
that Pre-P, Sit-P and Fin-P ratings were made by three-men teams varying in com- 
position each week; hence, any differences due to individual competencies or short- 
comings were presumably randomized. By contrast, the major portion of Precon-I 
ratings were made by the same three staff members, who served in this capacity 
throughout most of the summer. Proj-I ratings were made by six different staff 
members, four of whom served with three classes each, while one served with four 
and another with only two classes. 


RESULTS 


The comparisons to be made in this paper are those between the means of the 
six types of ratings on each of the 31 personality dimensions used for assessment 
purposes. 

For the majority of the 31 scales it was easily possible to identify a favorable and 
unfavorable vole on the continuum. In a few instances, e. g. scale 6, Gregarious vs. 
Non-gregarious, or scale 19, Marked vs. Slight Overt Interest in Opposite Sex, the 
favorable pole is not easily distinguished, since extremes in either direction on such 
continua may be presumed to indicate maladjustment. In these several instances the 
labelling is somewhat arbitrary, but does not appreciably affect the major trend to 
be demonstrated in Figure 1. 

The reference population was defined for staff members as “first-year clinical 
psychology graduate students at universities accredited by the APA to offer training 
in clinical psychology,” and since the subjects already had been accepted for grad- 
uate study by one or another of these universities, and the total population of sub- 
jects studied represented a large proportion of all students accepted for graduate 
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study in clinical psychology under the V. A. program that year, it may safely be 
assumed that the sample studied was a representative sample of the specified refer- 
ence population. Accordingly, it was to be anticipated that the means of ratings on 
the 31 scales—each being set up as a bipolar continuum with the extremes ranging 
from one to eight— would cluster around 4.5. 

The mean of each of the six types of ratings was computed for each of the 31 
scales and it is the plots of these means which are portrayed in the figures 1 - 3. Along 
the abscissa in each figure the scales are arranged in terms of magnitude and direction 
of the deviation of the Fin-P mean—ranging from the most extreme deviation in the 
direction of the unfavorable pole on the left to extreme deviation in the direction of 
the favorable pole on the right. Fin-P means were selected as the base for compara- 
tive purposes since these were group judgments derived from the maximum amount 
of data available about subjects in this study. 

Several features of the figures are immediately apparent. The means for various 
scales show wide divergences from the hypothetical mean of 4.5; the profile of means 
for all types of staff ratings, regardless of the type of information on which they are 
based, show the same general contour; and the profile of means of student ratings 
shows a markedly different contour from that of staff ratings. 

Figure 1 presents the profiles of means for Fin-P, Proj-I and Precon-I ratings. 
The gross agreement in configuration of these three profiles—as well as those of Fig- 
ure 2— suggests that the staff as a whole shared a more or less common definition of 
the several scales. From the direction of deviations away from the hypothetical 
mean of 4.5, it might be inferred that the staff as a whole—regardless of the type of 
information on which individuals’ judgments were based—appeared to view this 
population, according to the terms used in defining the scales, as markedly status- 
driven, possessed of higher inner tension, exhibiting poor sexual adjustment, some- 
what selfish, uninsightful, cautious, secretive, suspicious, rigid, anxious, etc. On the 


Fig. 1. Comparison of the means of ratings 
assigned by judges using projective tests only 
(Proj-I) and by judges doing “blind” analyses 
based on test performance and other materials 
(Precon-I) with the means of ratings assigned 
by the criterion team (Fin-P). The rating scale 
variables are described in“), 





more positive side this population of subjects appeared to the judges as very serious 
in purpose, quite independent-minded, consistent, conscientious, cooperative, 
strongly motivated for scientific work, socially poised, assertive, gregarious, etc. 
With respect to a majority of the dimensions studied, the staff viewed this group less 
favorably than it assumed the reference population to be, while in other respects 
it characterized this group as superior to the reference population. 

From the distribution of means in Figure 1, it appears that individual raters’ 
judgments based on projective test protocols alone, or those derived from blind an- 
alysis of projective and objective tests plus other personal documents, tend to be 
quite consistently less favorable than the consensus of a three-man team based on 
the maximum amount.and most diversified types of information available in this 
assessment. 

By comparing the Proj-I and Precon-I profiles of Figure 1 and the Pre-P profile 
of Figure 2 with respect to the Fin-P profile, it is apparent that the inclusion of inter- 
views as one factor and the pooling conference as another combine to effect a sub- 
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stantial change in the frame of reference. By comparison with Proj-I and Precon-I, 
Pre-P judgments tend to be somewhat less unfavorable where the staff’s frame of 
reference conduces toward unfavorable judgments and somewhat more favorable 
where the frame of reference conduces toward favorable appraisals. This is not a re- 
gression toward the mean, then, but rather reflects a general disposition on the part 
of Pre-P teams to make more favorable appraisals than did individuals working 
alone using only projective tests or a combination of objective and projective tests 
and other personal documents. 

By and large, among staff ratings Fin-P means show the smallest deviation in 
the direction of unfavorable poles when the staff’s frame of reference conduces toward 
judgments in that direction and, with Pre-P means, shows the greatest deviations 
in the direction of the favorable poles on scales where the staff’s frame of reference 
conduces toward favorable appraisals. Proj-I and Precon-I means tend in the major- 
ity of scales to deviate farthest in the direction of unfavorable poles and least in the 
direction of favorable poles, with Proj-I mean deviations being the most extreme 
more than twice as often as are those of the Precon-I raters. This relationship is 
emphasized in Table 1. To test the significance of the consistency of direction of 
deviation among the several types of ratings, the sign test® was used to compare 
the arrays of means of various types of ratings. 


TABLE 1. SIGNIFICANCE OF DIFFERENCES IN DIRECTION OF DEVIATION 
BETWEEN MEANS OF DIFFERENT TyPEs OF RaTINGs AS DETERMINED BY 
THE Sign TEsT 








Types of 
Ratings Fin-P Pre-P Sit-P Proj-I Precon-I 
Pre-P_ - —_ 

Sit-P 01 -- 

Proj-I .O1 .O1 — 

Precon-I 01 .05 — —_— 

Peer-M .O1 .O1 .O1 .O1 .O1 











The following relationships are found: both Precon-I and Proj-I means deviate 
a significant number of times from Pre-P and Fin-P means in the direction of the less 
favorable pole, and Proj-I means also deviate from Precon-1 means in the same 
direction a significant number of times. There is no significant difference in direction 
of shift between Pre-P and Fin-P means. 

Figure 2 compares the profiles of Pre-P and Fin-P means with that obtained 
from Sit-P ratings. While it is immediately apparent that the Sit-P profile occupies 
the same relative position with respect to the Fin-P profile as those of Proj-I and 
Precon-I means, the Sit-P profile is presented in a separate figure in order to em- 
phasize a different type of comparison. Whereas the profiles in Figure 1 compare 
two types of individual ratings based on tests and /or personal document data with 
a team consensus rating based on all available information, in Figure 2 a comparison 
is made between various types of consensus ratings based on somewhat different 
data and involving a different judge-subject relationship. 


Fig. 2. Comparison of means of ratings 
assigned by the criterion team prior to its ob- 
servation of situational tests (Pre-P) and the 
ratings assigned by judges who saw subjects 
only in the situational tests (Sit-P) with the 
final criterion ratings (Fin-P). 
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It is apparent from Figure 2 that the information available to Pre-P teams 
following the stage of Pre-P judging and up to the time of Fin-P judging exerted no 
consistent influence on the frame of reference of the judging teams. Means shifted 
in a more favorable direction on some scales and in a less favorable direction on 
others. That is to say, judging teams who added this information to an already large 
fund of information about subjects they had been studying for several days showed 
variable shifts in frame of reference as a consequence of exposure to performances 
of subjects on Situations Test Day. By comparison, teams who based their judg- 
ment solely on observations made on Situations Test Day exhibit a frame of refer- 
ence significantly less favorable than those reflected by either Pre-P or Fin-P ratings. 


Available data are not as satisfactory as one might wish for interpreting this 
phenomenon. In the interim between Pre-P and Fin-P judging, persons who made 
these ratings had several different types of information available to them. The 
major emphasis, of course, was on the situations tests, since one entire day was de- 
voted to such tasks and judges had an opportunity to observe each subject over a 
period of four to six or more hours. In addition, however, following Situations Test 
Day and immediately preceding the making of Fin-P ratings, these judges were 
permitted to examine all the ratings of a given subject made by various staff mem- 
bers, as well as those made by the three other members of the subject’s peer group. 
Such data were not available to judging teams who made the Sit-P ratings, since 
these men were required to base their appraisals solely on the observations of situa- 
tions test performance. 


It is the opinion of the writer, however, that the significantly less favorable 
frame of reference of the Sit-P raters was a function of the relationship of these raters 
to the subjects they were judging. The opinion is based on two types of evidence. 
When pairs of judging and subject teams worked together on Situations Test Day, it 
was evident that team A judges tended to think of group A subjects—the subjects 
whom they had studied for several previous days—as ‘‘our boys” and to refer to 
group B subjects—those with whom they had no previous familiarity— as “their 
boys’, thus associating the latter with team B judges. One had the impression that 
as a consequence of prior access to intimate personal information about the subject, 
relationships established between judge and subject during interviews, the tendency 
of teams to become more ‘“‘understanding” of their subjects the more judges shared 
opinions in the pooling conferences, etc., judges were rather benign in appraising their 
own charges in situations tests while investing another team’s subjects with a ‘“nega- 
tive halo”. Re-enforcing this view is evidence from a subsequent unpublished study 
indicating that when relationship between judge and subject is held constant, ob- 
servers of situations test performance make more favorable appraisals than do judges 
using only projective test materials. 

Figure 3 compares the profile of means of peer ratings with the profile of means 
of Fin-P ratings. The direction of deviation is significant beyond the one percent 
level according to the sign test. It is apparent that peers entertained an entirely 


Fig. 3. Comparison of means of ratings by 
peers (Peer-M) with means of ratings assigned 
by the criterion team (Fin-P). 
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different frame of reference for rating purposes than did the assessment staff, since 
the profiles of means of all the types of staff ratings depicted in Figures 1 and 2 ex- 
hibit the same general contour, which is markedly different from the contour of 
means of peer ratings. Most striking is the fact that peers tended to rate each other 
extremely favorably on the majority of scales, the only notable exception being the 
scale having to do with motivation for professional status. The staff tended to inter- 
pret this scale as referring to desire for rank, prestige, authority per se, hence in this 
scale the high end of the scale is classed as the unfavorable pole. Subjects apparently 
interpreted the scale differently. 


The disparity between these two profiles raises several interesting questions, 
which unfortunately cannot be answered from available data. It seems highly un- 
likely that the staff entertained a realistic impression of the hypothetical reference 
population specified in the rating scales, which was ‘‘first-year clinical psychology 
graduate students at universities accredited by the APA to offer training in clinical 
psychology.” In view of the year in which the study was conducted, it does not seem 
improbable that the staff was actually assessing a substantial if not major segment 
of the specified reference population. Hence the frame of reference of the staff must 
have been based on a population it would have liked to see beginning graduate work 
in clinical psychology, or its frame of reference was that of the hypothetical well- 
adjusted young adult, or what seems equally probable, subjects were evaluated on 
the various scales in terms of staff members self-appraisals, since it must not be for- 
gotten that these staff members were evaluating candidates for their own profession. 


By contrast, the subjects appear to have viewed each other as representing 
more or less the “‘cream’”’ of such a hypothetical group as the specified reference popu- 
lation, or, in consequence of personal misgivings growing out of impressions of their 
own performance throughout the assessment, tended to overestimate the favorable 
capacities or characteristics of their peers. Still another possibility is that peers were 
less objective in their appraisals than might have been desired, using the ratings 
scales as a means of giving each other a vote of confidence and thereby expressing a 
feeling of group solidarity. 


It is apparent from the foregoing that human operators, given a standard refer- 
ence population as a basis for making quantitative judgments about subjects, marked- 
ly differed in their descriptions of the experimental population. These differences ap- 
pear to be functions of such factors as the type of information with which judges are 
provided, the relationship between judge and subject, the purpose for which judg- 
ments are being made, etc. If the judge is conceived as merely transmuter of non- 
quantitative data into quantitative form, we are obliged to recognize that the trans- 
muter operates differently under different conditions, the ‘‘readings’ he registers 
being substantially influenced by factors other than the performance characteristics 
of the subject. This source of variance is often assigned relatively minor importance 
in researches designed to ‘‘validate”’ non-objective-type diagnostic procedures and 
devices where it is not the instrument which is being validated but the inference- 
making characteristics of a variety of human observers based on their interpretations 
of the performance record. 


SUMMARY 


Attention is called to the interaction of datum and observer in certain types of 
testing or measurement procedures. When judgments are the measures used in 
validation studies, it is the product of the datum-observer interaction which is being 
validated and not the instrument by which the subject’s performance is elicited. 


Means of various types of ratings are compared to show how factors operating 
on the observer can induce different frames of reference for the formation of judg- 
ments. 





WILLIAM F. SOSKIN 
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INTRODUCTION 


The purpose of this paper is to describe a rapid procedure for identifying pat- 
terns and to present the findings from an application of the method to data descrip- 
tive of 423 psychotic patients. The procedure to be described, called type-tracking, 
assumes that in a population studied, there may be found relatively stable subgroups 
or classes of persons each having in common a distinctive pattern of traits. Such 
groupings in a patient population have been the historical basis for the development 
of diagnostic classifications, although these classifications now often depend in part 
upon the recognition of specific pathogenic agents. 

The configuration of characteristics held in common by a distinguishable group 
of persons is here designated a pattern. The group of individuals exhibiting a dis- 
tinctive pattern represents a type. A pattern may be contrasted with a factor. The 
latter is defined by a group of traits all of which are intercorrelated. For a group of 
psychiatric patients, a pattern consists of a configuration of symptoms and com- 
plaints, such as those seen in the agitated depression. A factor, on the other hand, 
is a general morbid tendency such as depressive tendency or agitation which may cut 
across or be represented in more than one pattern, as depression is also represented 
in the retarded depression, and agitation is also represented in manic and in schizo- 
phrenic excitements. 

A profile or pattern descriptive of a person consists of a set of scores. Let Xj 
represent the score of person 1 on test j. If there are K variates (a, b, ¢, . . . j, k), 
then we may regard X4:, Xvi, Xe1 . . . Xx: the set of scores for person 1, as the 
coordinates of a point P;, in K-dimensional space. When the variates are independ- 
ent, they may be represented by orthogonal axes; correlated variates may be repre- 
sented by oblique axes. 

A population may thus be represented geometrically as N points in a K-dimen- 
sional scatter diagram, in which N is the number of persons and K is the number of 
scales. The problem is one of locating in such a scatter diagram the intervals in 
which the number of cases present exceeds the number which might be expected if 
the scores were unrelated. If the number of cases falling into an interval is signifi- 
cantly greater than expectancy, it would indicate a clustering in the diagram. The 
actual procedure followed will be illustrated at further stages in the description. 


PROCEDURES 


_ Data for the study were secured by means of the Northport Record for the Des- 
cription of Psychiatric Patients, a rating schedule which consists of 81 brief scales. 
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One section consists of 46 scales which can be completed in 10 minutes following a 
relatively thorough psychiatric interview of the patient. The second section in- 
cludes 35 scales descriptive of the patient’s behavior on the ward. It is completed 
on the basis of information secured by interviewing the nurse or psychiatric aide who 
knows the patient well. Only 54 of the scales were used in this study. 

The sample consisted of 423 hospitalized male veteran psychotics from four 
hospitals. Ratings were undertaken by staff members of these hospitals. The largest 
group consisted of approximately 225 patients, representing a 10 percent sample of 
the VA Hospital at Northport, Long Island, New York. Efforts were made to in- 
clude representation of all types of patients such as disturbed, grossly deteriorated, 
chronically sick, and those assigned to an open ward. Cases were added from VA 
hospitals at Roanoke, Virginia; Coatesville, Pennsylvania; Lyons, New Jersey; and 
Lexington, Kentucky. 


METHOD oF ANALYSIS 


Our method of analysis was to begin with known syndromes or factors. For 
example, previous work had revealed that ideas of reference, delusions of influence, 
ideas of persecution and hypochondriacal delusions are associated and may be re- 
lated to a paranoid factor. This factor was used as a starting point. A pattern was 
accordingly made up of scale values 3 and 4 on our scale for ideas of reference, scale 
values 2, 3 and 4 on our scale for delusions of influence, scale values 3 and 4 on our 
scale for ideas of persecution and scale values 2, 3 and 4 on our scale for hypochon- 
driacal delusions. Since only one entry may be made on a scale, the maximum score 
or total number of agreements possible with this pattern is four. Such a score would 
indicate the presence of delusions of influence and hypochondriacal delusions, and 
that ideas of reference and of persecution were present in more than mild degree. 

Other patterns were made up for schizophrenic disorganization, for catatonic 
withdrawal, and for the mild schizophrenic picture to which the name ambulatory 
schizophrenia is commonly given. Patterns were also prepared for manic-depressive 
psychosis, manic phase, for manic-depressive psychosis, depressive phase, and for 
the agitated depression characteristic of involutional melancholia. 

These patterns covered, with one exception, all the syndromes in our material 
which might be inferred from the results of a direct factor analysis. Resistiveness 
and assaultiveness are closely associated, and characterize a very large fraction of 
the patients for whom the operation of prefrontal lobotomy is performed in our hos- 
pitals. It seemed of interest to determine to what extent they might form a separate 
clinical group, and a preliminary pattern of resistive-assaultive behavior was ac- 
cordingly prepared. 

The rating pattern of each of the 423 patients studied was punched on an IBM 
card. Each of the eight patterns prepared was also punched on a card. By means of 
a card punch scorer, each card was scored on each of the eight patterns. A score con- 
sisted of the total number of agreements between the patient’s rating pattern and a 
master pattern. The distribution of scores was obtained and cutting points were 
established toward the higher end. The cutting points were selected so as to mini- 
mize the number of cases falling into more than one group. 

Those cases falling above the cutting point on one and only one pattern were 
utilized for the preparation of a second set of patterns. For inclusion of a scale step 
in the new patterns, the following criteria were set up: 


(1) The number in the scale step must exceed expectancy by 33%. (The entry 
“‘unrateable” was treated as a scale step). 

(2) If more than one scale step exceeded expectancy by 33%, that which showed 
the greatest excess was selected, together with any of the others which were 
contiguous with it. Two non-contiguous scale steps falling more than 33% 
above expectancy were made contiguous by inclusion of intervening scale 
steps if the total in all cells so made contiguous exceeded expectancy by 33%, 
but this rule was not applied if two contiguous steps were already included. 
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The entry unrateable was considered not to be contiguous with any other 
scale step. Its selection automatically excluded any other items, and vice 
versa. If the unrateable step had the highest excess over expectancy, yet had 
less then one-half of the cases in the particular type group, a new comparison 
was made. The expectancy for the remaining steps was recalculated with 
the unrateable category and the cases in it excluded. If one or more of these 
steps exceeded this recalculated expectancy more than the unrateable cate- 
gory exceeded its expectancy, the rateable scale steps were included in place 
of the unrateable category under the usual rules regarding contiguity. 


The procedure followed may be illustrated by means of the example shown 
below: 


ScaLeE SrTeps 








Measure 





Total group 
frequencies 


Expectancy for 25 cases 
(figures for total group 
multiplied by 25/423) 


Observed frequencies 
for scale 


Observed frequencies 
expressed in per cent of 
expectancy 29 158 100 133 686 


























The number in the total sample rated at each scale step is shown in the first line. 
The second line presents the expected frequency for each scale step when the group 
above the cutting point consists of 25 cases. The third line presents the actual ob- 
served frequencies at each scale step of the group defining the type pattern. The 
last line expresses the observed frequencies in terms of per cent of the expected fre- 
quency. Steps 2, 4, and 5 all exceed the expected values by more than our criteria of 
33 percent. The inclusion of the contiguous steps 4 and 5 prevented the inclusion of 
the contiguous steps 2, 3, 4, which would otherwise have been included as their com- 
bined total exceeds expectancy by over 33%. 

Each scale was thus examined for a particular group above the cutting point on 
a pattern. Scale steps on any scale that satisfied the criteria described above were 
used to define the subsequent pattern for the group. 

From such a beginning, we proceeded toward the clearer definition of six pat- 
terns of morbid tendency and one pattern which proved to be a pattern of normality. 
The cases were scored on the newly derived pattern #2, cutting points were estab- 
lished, and those cases falling above the cutting point on one and only one pattern 
were utilized for the definition of the next version of the pattern. This process was 
repeated. This procedure tended to promote the separation of the patterns by the 
use of ‘‘pure’’ groups, that is, groups of cases which fell above the cutting point on 
only one pattern. However, while most of the patterns tended to show progressive 
separation of their groups above the cutting point with successive patterns, the re- 
sistive-assaultive pattern formed a notable exception in that its overlap increased 
despite use of only ‘‘pure’’ cases. 

This pattern was abandoned as not constituting a separable psychiatric syn- 
drome, but rather only a type of behavior likely to occur either in schizophrenic 
excitement or in catatonic withdrawal. Cases overlapping this pattern were no 
longer excluded from other ‘“‘pure’’ cultures. 
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RATIONALE OF METHOD 


It is worth noting that with the limited size of the population with which we 
were dealing, and the placement of cutting points toward the end of the scale, chance 
factors were bound to be important in determining the inclusion or non-inclusion of 
particular scale steps in the pattern. However, when as in the present study, the 
number of scales is fairly large, this fact constitutes an important stabilizing ele- 
ment. It is worthy of note, furthermore, that the solution achieved by this method 
is not totally independent of the point of beginning. While it is true that no well- 
defined cluster should escape the careful investigator using this method, and no false 
cluster or artefact should be obtained, still, the precise definition of the pattern for 
an important and well-defined cluster may be expected to vary slightly depending 
upon the taking-off point. In this sense the solution achieved is not unique. 

The analogy might be made that the procedure is essentially like rolling a ball 
down a slope of a depression to determine a low point. The point at which the ball 
is released and the pockets in the bottom of the depression may determine the pre- 
cise place at which it comes to rest—unless, indeed, there are no such pockets, or, in 
our analogy, no subclusters within the main one. 

Furthermore, the method is, of course, dependent upon the population to which 
it is applied. However well-marked a diagnostic group, if it is too poorly represented 
in the experimental population, it will not come out as a type. For example, if we 
had a very few cases of paresis in the population, it would be very difficult to define a 
paretic type by this method since the method depends upon statistical tendencies. 
Furthermore, the differentiation can, of course, proceed only in terms of the material 
included. If a condition is best differentiated in terms of Argyll-Robertson pupils, 
slurred speech, tremor, serological and colloidal gold reactions, as is the case in 
paresis, then these elements should be included in the data utilized for type-tracking. 


DEFINITION OF TYPES 


The increasing definition of types is evident in the evolution of the patterns. 
For the six patterns which went through all stages—resistive isolation, schizophrenic 
disorganization, paranoid stabilization, ambulatory schizophrenic, manic and agi- 
tated depression, the initial patterns were based on a range of from four scales for the 
paranoid to 39 for the ambulatory schizophrenic, the average being 15.5. The 
patterns *3 involved 2 range of from 39 to 53 scales with an average of 44.7. New 
entries were included and old entries dropped out. 

Certain developments are of interest such as the fact that the pattern of the 
ambulatory schizophrenic quickly evolved into a pattern of normality. While this 
had not been anticipated, it is obvious that in a mental hospital population, that 
which distinguishes the ambulatory schizophrenic is his relative normality. Pursuit 
of this difference leads into a pattern which is that of a normal person except for the 
scale of insight in which the pattern centers on the realization of having a mental 
illness. Of course, for the psychotic in a mental hospital this is the scale entry re- 
flecting the greatest degree of normality. It is worth emphasis that those individuals 
falling above the cutting point on this scale are not normal. They are simply dis- 
tinguished by being more nearly normal in their clinical picture than the population 
as a whole. The fact that a pattern of normality can be found by contrasting this 
group with those more seriously disturbed reflects how dynamic the tendency to 
normal homeostasis really is. 

Most of the groups above the cutting point stabilized fairly well by the third 
pattern. Tetrachoric correlations between the selection of cases by patterns *2 
and patterns #3 were all above .97 except for the pattern of schizophrenic disorgan- 
ization, which was .92. 

Now that the pattern of panicky agitation was reasonably well defined, and it 
is well-known that the picture of the depressed phase of the manic-depressive psycho- 
sis is separable clinically, another effort was made to define this pattern in our mat- 
erial, even though the group was not well represented. For this purpose, all cases 
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were selected showing depression (scale 13, steps 1 or 2), feelings of unworthiness 
(scale 38, steps 2, 3 or 4), and high emotional reactivity (scale 12, steps 2, 3 or 4). 

A new pattern for mournful depression, prepared from this material by judg- 
ment rather than rigid rule, gave 24 cases above the cutting point. Two of these 
overlapped with the cases above the cutting point on the third pattern of agitated 
depression, two with the pattern of normality and one with the pattern of manic 
excitement. 

A fourth set of patterns was prepared on the basis of the cases above the cutting 
point on the third patterns and the reintroduced pattern of mournful depression. 
For the inclusion of a cell in this pattern, the requirements were that the cases in the 
cell should be at least 100% more than expectancy, or they should be all in that one 
cell, or all but one in one cell and that one in six (unrateable). Exception was made 
for the pattern of normality which, being defined by the absence of pathology, needed 
a more liberal allowance. Accordingly, only 50% more than expectancy was required, 
rather than 100% more than expectancy as for the other patterns. 

Since the correlation coefficient for the cases above the cutting points on patterns 
#2 and #3, schizophrenic disorganization, reached only .92, the cases were scored 
on the fourth pattern and a cutting point set for this group. The result in terms of 
stability of group membership is indicated in a tetrachoric coefficient of .97. 


THE FINDINGS 


The patterns derived were named as follows for morbid characteristic and re- 
lated diagnosis:* 


The pattern of normality. 
The pattern of resistive isolation: catatonic withdrawal. 
The pattern of schizophrenic disorganization: schizophrenic excitement and 
hebephrenic disorder. 
The pattern of psychotic reorganization: paranoid stabilization. 
The pattern of manic excitement: manic-depressive psychosis, manic phase. 
a pattern of mournful depression: manic-depressive psychosis, depressive 
phase. 

7. The pattern of panicky agitation: agitated depression. 


An examination of the patterns themselves and of the intercorrelation of their 
scores reveals two patterns—schizophrenic disorganization and resistive isolation— 
which represent a severe departure from the normal. There are four patterns—manic 
excitement, panicky agitation, psychotic reorganization and mournful depression 
which represent, in that order, lesser degrees of departure from the normal. The 
closest resemblance borne by any of these patterns to schizophrenic disorganization 
is that of the paranoid pattern, psychotic reorganization. The closest resemblance 
of any of these to the catatonic pattern is that of mournful depression. The manic 
and mournful depression pattern contrast at points, but the pattern of panicky 
agitation has elements in common with both. The paranoid pattern has elements 
in common with both the manic pattern and the pattern of panicky agitation.* 

A verbal description of the pattern of psychotic reorganization: paranoid stab- 
ilization may serve as a sample. In the group of patients above the cutting point on 
pattern #*3 psychotic reorganization: paranoid stabilization, each of the following 
descriptive elements was represented with a frequency at least twice that of the 
group as a whole. 


_ Paranoid distortion is of course central. The patient has ideas of reference, de- 
lusions of influence, ideas of persecution and hypochondriacal delusions. 


Perceptual distortion is prominent. He complains of unreality feelings or mani- 
fests a delusional conviction that things are changed. His delusional beliefs are im- 


*These patterns are be ag included in a VA technical bulletin on the use of the Multidimensional 


Scale for Rating Psychiatric Patients, Form for Hospital Use. 
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plausible or even bizarre. He sees visions and hears voices, and is positive of the 
reality of the latter. 


This pattern is not characterized by thinking disorganization and typically the 
patient uses no stereotyped words or phrases. 


The patient is oriented. He is oriented for time, knows who he is and can differ- 
entiate among hospital personnel and patients. 


The patient has an inflated self-esteem. He is markedly boastful, conceited or 
grandiose power, and possesses a delusional conviction of having fantastic or grand- 
iose power, knowledge, or wealth. 


The patient is slightly obsessive. He admits to having insistent, recurrent thoughts 
but usually can control or stop them. He occasionally thinks of committing a hostile 
criminal act. However, he shows no evidence of feelings of unworthiness. 


The patient is not anxious. He expresses concern only to the degree called for 
by the circumstances. 


The patient has average mood stability. He reacts with adequate feeling, exhibits 
only the expected and usual mood variations, and his behavior is as restrained or 
informal as the situation would demand. 


The patient does not show motor distortion. He shows no manneristic postures or 
movements. 


The patient does not withdraw. He does not wet himself. He will start a conver- 
sation or enter into it readily for an appropriate length of time. He participates in 
and shows normal curiosity and interest in the activities provided on the ward and 
elsewhere. 


The patient is somewhat hostilely assertive. He expresses as much as or more 
hostility than would ordinarily be expected. His use of profane and obscene language 
is of ordinary frequency. He is assertive, seeking to influence or dominate others. 
Occasionally he may assault others. 


The patient is not distinguished by resistiveness, although his delusions typically 
interfere moderately with his hospital adjustment, and may limit his freedom. 


SUMMARY 


A method of type-tracking is described, together with its application to date of 
54 rating scales on 423 male psychotic patients in Veterans Administration hospitals. 
We believe this method to be useful. The result obtained is not, however, entirely 
independent of the starting points. 

Application of the method resulted in the isolation of types from which the 
following patterns were evolved: (1) The pattern of normality, (2) The pattern of 
resistive isolation: catatonic withdrawal, (3) the pattern of schizophrenic disorgani- 
zation: schizophrenic excitement and hebephrenic disorder, (4) the pattern of psy- 
chotie reorganization: paranoid stabilization, (5) the pattern of manic excitement: 
manic-depressive psychosis, manic phase, (6) the pattern of mournful depression: 
manic-depressive psychosis, depressive phase, and (7) the pattern of panicky agita- 
tion: agitated depression. 

Such patterns can be used in scoring the Multidimensional Scale for Rating 
Psychiatric Patients, a development from the Northport Record. 
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Current reports“: ?: 3. 5. 6 8) indicate considerable disagreement over the status 
of the Machover DAP technique“. This research is designed to investigate the 
validity of the Machover DAP method as it is clinically used for the description of 
adult personality. 

POPULATION 

The population consisted of 31 neuropsychiatric patients in an Army NP center 
in Korea. 

PROCEDURES 


The patient sample was selected from all patients entering the center during a 
two week period. During this period 76 patients were admitted, but 45 were ex- 
cluded because they failed to meet the selection criteria such as English language 
comprehension, literacy, physical ability to take tests and willingness to undertake 
testing. 

The morning of the day following admission of the patient, his physician and 
wardmen were notified of his inclusion in the study. At this time both his psychia- 
trist and chief wardman were provided with a rating scale (Figures 1 and 2) to be 
used in the description of the patient’s behavior. All such ratings were to be made 
on or about the fourth day of the patient’s stay. All ratings were made independent- 
ly; psychiatrists and wardmen were not informed of each other’s findings nor of the 
psychological findings until following the completion of the rating scales. The 
psychologists also employed the same scale. Psychological tests were administered 
by technicians on the third day; tests requiring individual administration were given 
in the morning, group tests in the afternoon. The psychological battery! was inter- 
preted blindly by two psychologists who employed pooled judgments in their inter- 
pretation of the battery. They had not administered the tests and were unaware of 
the findings of the other raters. 

The DAP was included in the morning test administration. The DAP’S were 
separated from other tests and blindly interpreted by the psychologists. The DAP’s 


Ficure 1. TRAITS ON THE PSYCHIATRIST AND PsycHoLoaist Cueck List 


Drive to Social Participation 20. Flattened inappropriate, affect disturbance 
Tendency to Social Withdrawal 21. Manic 

Problems in Handling Aggression 22. Guilt and depression 

Dominance . Sadism 

Under Control 24. Inversion 

Over control 25. Delinquent heterosexual behavior 
Anxiety 26. Auto-erotic disturbances 
Insecurity . Infantile, immature sexuality 
Dependent . Sexual inadequacy problems 
Passivity 29. Castration fears 

Mother Rejection . Non-specific sexual distress 
Tendency to identification with opposite sex . Poor integration-personality disorganiza- 
Maturity tion 

Infantile and immature 2. Organic damage 

Other defenses 33. Hysteria 

Overconcern with bodily processes and 34. Alcoholism 

imagery 5. Defective intelligence 

Narcissim . Paranoid tendencies 

Disturbed interpersonal relationships, . Schizophrenic tendencies 
non-specific . Psychopath 

Poor reality contact 
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1MMPI, Sentence completion, word association, group Rorschach, Shipley-Hartford. 
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were identified only by code number, as were the other tests". Once again the inter- 
pretations were pooled. 

The behavior rating scale employed by psychiatrists, chief wardmen and 
psychologists was based upon a content analysis of Machover’s book. The rating 
scales included 38 personality characteristics which Machover contended could be 
assessed by means of the human figure drawing. The content analysis involved the 
recording of each drawing feature which was considered interpretable. Drawing 
features which involved primarily subjective judgments and which could not be 
related to specific graphic features of the drawing were excluded, e.g. ‘furtive and 
suspicious eye’’.‘?. P- #8) The rating scale represented a condensation of all the 
possible interpretations into related and consistent general personality character- 
istics, including the specific sub-areas of personality specified in Machover. 

The rating scale of the wardmen differed in some respects from the scale used 
by psychiatrists and psychologists. Their rating scale (23 items) redefined the 
psychological terminology of Machover in more operational terms and represented 
overt behavior where it was felt the corpsmen would have an opportunity to ob- 
serve and therefore judge. Prior to using the rating scales the terminology and be- 
havior listed thereon were discussed with all participating corpsmen and psychia- 
trists. 


Figure 2. Traits ON WARDSMEN’sS CHEcK List 


Aggression . Poor reality contact 

Under control . Flattened affect, inappropriate 
Over control : . Guilt and depression 

Anxiety . Sadism 

Insecurity . Delinquent heterosexual behavior 
Dependent . Inversion 

Passivity . Alcoholism 

Maturity . Defective intelligence 
Infantile and Immature . Paranoid tendencies 
Overconcern with bodily processes and image . Psychopath 

Narcissim 23. Manic 

Disturbed interpersonal relationships, 

non-specific 


.. 
2. 
3. 
4. 
5. 
6. 
e 
8. 
9. 


Each personality characteristic was to be rated on a three-point continuum; 
minimal, moderate, or maximal presence. 

The techniques employed in patient ratings using the DAP’S by Machover’s 
interpretive system involved assigning the rating of minimal to any of the person- 
ality characteristics for which no interpretable feature was found in the patient’s 
drawing. A rating of moderate was assigned to any personality characteristic which 
had one or two corresponding drawing features present in the human figure; and a 
rating of maximal was assigned to any personality characteristic which had three 
or more presence indicators in the drawing. 

The psychologists also completed a rating scale for each patient on the basis of 
their own impressionistic technique for DAP interpretation, apart from the prin- 
ciples outlined by Machover. 

This procedure then provided for five ratings for each patient; (1) a rating based 
on the DAP test using Machover’s interpretive system, (2) a rating based on the 
DAP using the psychologist’s own interpretive system, (3) a psychiatrist’s rating, 
(4) the chief wardman’s rating, and (5) a rating based on the blind interpretation 
of a battery of psychological tests by the psychologists. 


RESULTS 


The data were categorized into fourfold contingency tables. The ratings of 
“‘moderate” and ‘maximal’? were combined and contrasted to ‘‘minimal’’ to form a 


present-absent dichotomy. Fisher’s test of association“) was employed, utilizing 
Yates’ correction. 


*Code numbers for the DAP differed from those of the battery. 
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Table 1 presents the overall percentage of agreement-disagreement as well as 
the significance of association of agreements for the Machover DAP with the psycho- 
logical battery, the psychiatrist’s rating and the chief wardman’s rating. It also 
presents similar figures for the psychologists’ impressionistic DAP interpretation 
with ratings by the psychological battery, by the psychiatrists and the ratings by 
the wardmen as well as a comparison with the Machover DAP scheme. Table 1 
also presents the data on agreements among the psychological battery, the psychia- 
trists’ ratings and the wardmen’s ratings. 


TABLE 1. COMPARISON OF DIFFERENT RaTING PROCEDURES 








No. of 
Personality Average 
Characteristics | Percent Significant 
Summarized Data Rated Agreement Association 





Machover DAP ratings compared with ratings 
by a battery of psychological tests 56 None 








Machover DAP ratings compared with ratings “Sadism”’__ 
by psychiatrist beyond .01 
Machover DAP ratings compared with ratings 
by chief wardmen None 
Machover DAP ratings compared with im- “dominance”’ .01 
pressionistic DAP ratings “manic” 01 











Impressionistic DAP ratings compared with 
ratings by the psychological battery : “organic” .05 





Impressionistic DAP ratings compared with “mother 
ratings by psychiatrist , rejection”  .05 





Impressionistic DAP ratings compared with 
chief wardmen’s ratings None 








Psychiatrists ratings compared with ratings by “overconcern with 
a battery of psychological tests bodily processes and 
imagery” .05 


“flattened, unappro- 
priate affect dis- 
turbance” .05 





Chief wardmen’s ratings compared with rat- 
ings by a battery of psychological tests 23 None 








Psychiatrist’s ratings compared with ratings 
by chief wardmen 23 51.5 None 

















An examination of results indicates that only 7 out of 320 X? tests are significant 
beyond the .05 level. Because of the distribution of X? we cannot be assured that our 
observed significant associations are due to any but chance factors. Little credence 
can be given, therefore, to the few significant agreements among the rating scales. 


RanK Data 


Because of the lack of significant association observed, not only among the 
ratings made by the Machover DAP as compared with other rating procedures, but 
also because of the low agreement among these other rating procedures themselves, 
it may be well to present our data in another manner. 

Table 2 presents the rank order of the various rating procedures employed, 
ranked on the basis of the over-all percentage of agreements between any two rating 
procedures on all patients for all personality characteristics rated. 
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TaBLE 2. SummarizeED Data: RANK ORDER OF RATING PROCEDURE 
CoMPARISONS BY AVERAGE PERCENTAGE OF AGREEMENTS 








Psychological battery—psychiatrist 
Machover DAP—impressionistic DAP 
Impressionistic DAP—psychological battery 
Impressionistie DAP—psychiatrist 
Machover DAP—psychological battery 
Machover DAP—psychiatrist 

Psychological battery—wardmen 
Impressionistic DAP—wardmen 
Psychiatrist—wardmen 

Machover DAP—wardmen 





Table 3 ranks each of the specific personality characteristics which have been 
rated, in order of the average agreement upon them by the rating procedures. Thus 
Rank 1 in this table ‘‘anxiety”’ represents the specific personality characteristic 
which was agreed upon most often by the various comparisons of psychiatrist, 
psychological battery, and the two forms of DAP interpretation. In view of the con- 
sistently low agreement of the wardmen with these other rating procedures and in 
view of the revised definitions on the wardmen’s rating scale, the data from the 
ratings by the wardmen compared with other rating procedures were not included 
on this table. 


TABLE 3. AVERAGE PERCENT OF AGREEMENT ON ALL PERSONALITY CHAR- 
ACTERISTICS RaTeD BY ALL COMPARISONS OF PSYCHIATRIST, PsycHo- 
LocicaL Batrery, Impressionistic DAP anp Macnover DAP Com- 
PARISONS. (RATINGS COMPARISON BY WARDMEN WITH THESE OTHER 
Procepures Not INCLUDED.) 








Overall Average 


Personality Characteristic Percent Agreement 





Anxiety 

Problems in handling aggression 

Sadism 

Non-specific sexual distress 

Other defenses 

Organic damage 

Overconcern with bodily processes and imagery 
Infantile and immature 

Manic 

Infantile immature sexuality 

Insecurity 

Defective intelligence 

Alcoholism 

Castration fears 

Poor integration-personality disorganization 
Mother rejection 

Tendency to social withdrawal 

Dependent 

Dominance 

Paranoid tendencies 

Flattened inappropriate affect disturbance 
Guilt and depression 

Inversion 

Delinquent heterosexual behavior 
Psychopath 

Drive to social participation 

Maturity 

Poor reality contact 

Schizophrenic tendencies 

Auto-erotic disturbances 

Hysteria 

Over control 

Passivity 

Tendency to identification with the opposite sex 
Under control 

Disturbed interpersonal relationships, non-specific 
Narcissism 
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The rankings presented in Tables 2 and 3 were not subjected to statistical tests. 
Because of the relatively slight numerical differences between ranks we must be 
cautious in their interpretation. 


DIscUssION 

A review of our results indicates that the Machover DAP does not agree to any 
significant extent with any other commonly employed clinical methods for des- 
cribing personality and dynamics. The personality characteristics in which there 
is no agreement are those which the Machover technique purports to describe, as 
derived from a content analysis of her book. 

Furthermore the DAP interpretive scheme of the clinicians used here, an im- 
pressionistic one which varied widely from that of Machover (as indicated by its 
failure to agree significantly with the Machover method) also failed to agree sig- 
nificantly with any of the other common clinical instruments. It strongly appears 
that the DAP technique, whether interpreted by means of the Machover or by our 
psychologist’s intuitive method is not a valid indicator of personality dynamics or 
behavioral diagnoses, as measured by ratings by psychiatrists, a battery of psycho- 
logical instruments or by chief ward attendants. 

Further analysis of our data reveals that the DAP is not the only instrument 
which fails to agree with other procedures on the rating of patient behavior. Our 
results show that no one of the common clinical instruments used in this study’ 
agreed to any significant extent with each other on any of the 38 personality vari- 
ables observed. Thus no one of the frequently employed personality assessment 
devices could be considered ‘‘valid” if by validity we mean the extent to which it 
measures what it purports to measure when the absolute standard is conceived as 
any one or several of the other rating procedures employed. Therefore the psychia- 
trist agrees neither with the psychological battery nor with chief wardmen; and vice 
versa the wardmen with neither psychologists or a psychiatrist. In each case “agree” 
here means to a statistically significant extent as measured by a test of association. 

In general, the DAP technique is not shown here to be valid as measured. But 
in terms of percentage agreement with other procedures it is but little worse than 
other clinically accepted ‘‘standards’’. The lack of agreement between DAP inter- 
pretations and these “standards” appears to be merely one aspect of the wide and 
serious disagreement among the various clinical procedures which speaks not well 
for the present status of personality description. 

An examination of our percentage data shows that the most frequent agreement 
among instruments or rating procedures or ‘“‘standards’’ occurs between a battery 
of psychological techniques and the psychiatrist; and that the next most frequent 
agreement is between the two schemes for DAP interpretation. We find that the 
chief wardmen are consistently lowest in their agreement with any of the other 
rating devices used. Although these percentages do not reach significance it is 
possible that had the N been larger the trends observed here would have been borne 
out significantly. 

Aside from the size of the N, a number of inadequacies are apparent in the 
study. For one, there was but little training on the rating scale employed, and our 
obtained results may indicate semantic problems in the use of descriptive and diag- 
nostic terminology thereon. The problems of semantics and disagreement over 
commonly used clinical terms can be considered a serious difficulty in itself. 

Particularly the low agreements that wardmen have with other raters may 
represent lack of training for reliability of observation. The reliability problem is 
met in other aspects, too. Five psychiatrists participated in this study; we have no 
indication of the extent to which they would agree with each other; nor with test- 
retest reliability with themselves. Similarly we have no data on the reliability of the 
psychologists, although an attempt to increase reliability was made through the 
use of pooled judgments. 


*Wardmen’s rating scales, psychologist’s  sating scales based on a battery of psychological tests 
and psychiatrist’s rating scales. 
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One of the possible contributing sources of error is found in the nature of the 
neuropsychiatric center where this study was undertaken. All of the study parti- 
cipants were relatively unused to working together; the center had constant shifts 
in personnel with stays ranging from two months to perhaps one year; thus reducing 
the opportunity for the development of clinic group “norms” or a full communality 
of approach. 

Another criticism may be leveled at the varying degrees of training of the 
judging personnel; one aspect of the reliability problem. Nevertheless, these in- 
dividuals were functioning in an established and important psychiatric center 
participating in the diagnosis and treatment of wide varieties of neuropsychiatric 
disorders. Thus the data here indicate the disagreements which may be found in 
an actual working NP institution. 

Table 3 suggests the areas, either of behavior, terminology or theoretical ex- 
pectancy where agreement is most and least likely to occur. The centrality of most 
of these personality characteristics to the field of clinical and abnormal studies re- 
quires that further attention be paid to the lack of significant agreement upon them 
by working practitioners in the field. 


SUMMARY AND CONCLUSIONS 


Ratings based on the Machover interpretation of the DAP were compared with 
ratings by psychiatrists, a battery of psychological tests and an intuitive DAP 
scheme on 38 personality characteristics in 31 NP patients. On 23 personality char- 
acteristics the Machover DAP ratings were also compared with ratings by chief 
ward attendants. There were no consistent significant agreements of the Machover 
DAP with any of these clinical procedures or ‘‘standards” nor were there any sig- 
nificant agreements of the intuitive approach to DAP interpretation with any of 
these clinical rating scales. It was concluded that the DAP was not clinically valid 
as here measured. 

All of these rating procedures were compared with one another for each person- 
ality characteristic and no consistent significant agreements were found. It was 
concluded that there is little agreement among any of the common clinical instru- 
ments for personality description. 

The various personality characteristics rated were discussed in terms of the 
average frequency of agreement upon them. It was found that. “anxiety” and 

‘problems in handling aggression’’ were most likely to be agreed upon by psychologi- 
cal battery, psychiatrists and the two DAP schemes, although the agreements were 
not statistically significant. 

The Machover DAP technique has highly questionable validity, but proves to 
be no worse than any of the other common clinical personality assessment procedures. 
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THE CONDITIONED REFLEX IN THE CHRONIC SCHIZOPHRENIC 
H. N. PETERS AND O. D. MURPHREE 
Veterans Administration Hospital, North Little Rock, Arkansas* 


INTRODUCTION 


The subjects used in the two studies to be reported were test and control patients 
in another, extended investigation of learning or problem solving in the chronic 
schizophrenic © ©. The patients in this learning investigation were put through a 
three months program. The test patients (Group I) were required to learn pro- 
gressively more difficult problems with insulin stimulated hunger as a motive. Group 
II patients were put through exactly the same procedure, with the exception of the 
learning; Group III patients were pure controls, and followed the ward routine. The 
patients, who were sorted into the groups on a random basis, were selected with the 
following criteria: (1) There was no question of the diagnosis of schizophrenia; (2) 
the patients had been hospitalized for a minimum of one year, a maximum of four; 
(3) they were all World War II veterans; (4) with the exception of ward routine, 
they were not cooperating in any of the reality-directed phases of the hospital pro- 
gram. 

Previous papers have described the procedures and results of the first three 
cycles of 12 patients each“. 7). In terms of various measures of clinical status, a 
reliable difference was found favoring Group I. Some of Gantt’s work, which sup- 
ports the conclusion that the CR is a valid indicator of the general clinical condition 
of the schizophrenic patient ®?, clearly points up the probable profit in studying the 
CR in the chronic schizophrenic. The immediate purpose of the studies reported 
herein was to determine whether or not the clinical improvement and effects of the 
learning treatment are reflected in the CR. 


METHOD 


In the present work the psychogalvanic reaction (PGR) was investigated, 
chiefly because of the simplicity of apparatus for measuring the reaction and because 
of the unpredictability of voluntary reactions with non-cooperative patients. An 
ink recording polygraph and galvanometer, with finger electrodes, were used*. The 
record was made on paper moving at a fixed rate of approximately 1.8 millimeters 
per second. One ink line showed the changes in level of skin resistance; time of stimu- 
lation was indicated by a parallel stimulus marker. The bridge was balanced at the 
beginning of a sitting at the same sensitivity level for every subject. A Hunter In- 
terval Timer was used to control the time of stimulations. 

Seven normal individuals were put through the conditions of each experiment. 
Although these individuals were roughly comparable in age with the patients, they 
were generally of a higher socio-economic level. They were used only for the purpose 
of determining a rough base for evaluating the performance of chronic schizophrenics. 

The time interval between completion of the major study and the CR investi- 
gations differed in the two experiments. In the case of the patients of the first three 
cycles, who were used in the first experiment, this interval varied from five months 
to a year. The subjects of the next three cycles, who were used in the second experi- 
ment, were tested from several days to five months after the major experiment. 

In experiment #1 the patient was stimulated with red and green lights, each 
shining for 15 seconds. During the training period, the green light (CS) was followed 
immediately by the loud blast of a horn’. The procedure used with each patient in- 


‘Reviewed in the Veterans Administration and published with the approval of the Chief Medical 
Director. The statements and conclusions published by the authors are the results of their own study 
and do not necessarily reflect the opinion or policy of the Veterans Administration. 

2Lafayette Instrument Company, Lafayette, Indiana. 

8A very loud horn, called “adaptahorn”’, manufactured by Edwards Company, Norwalk, Conn. 
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volved 50 random stimulations given during one sitting with brief rest periods after 
the 10th, 25th, and 35th stimulations. The first 10 stimulations were with five red 
and five green lights without the horn. The next 20 stimulations were with 10 red 
lights and 10 green lights, the latter reinforced with the horn. The last series of 
stimulations, the testing phase of the set-up, were 10 red lights and 10 green lights. 

In experiment #2 the same red and green lights were used, each shining for a 
period of only three seconds. However, a buzzer was sounded at the same time as 
the light to attract the patient’s attention. UCS was an electric shock given on the 
palmar surface of the right hand. The pre-training period was of three parts: (1) 
Four stimulations with the light alone, in the order G-R-G-R. (2) Adjustment of 
the electric shock to a point where a marked and consistent PGR was obtained on 
three successive shockings‘. (3) Four stimulations with the lights alone, in the 
order R-G-R-G. 


In the training and post-training periods the red and green lights were presented 
in random order. There were four parts to the training, each part consisting of (1) 
five red lights and five green lights, the greens followed immediately by the shock, 
and (2) two reds and two greens without the shock. The post-training period was 
filled with 10 stimulations with the green light alone (extinction). 

In both experiments the interval between stimulations was variable, depending 
upon the time necessary for the individual’s galvanic reaction to become stablized. 
This interval was seldom less than 20 seconds or more than 60. In administering the 
stimuli, the experimenter watched the PGR marker and waited until it returned to 
the balanced point, adjusting the bridge if necessary, and was either stable or grad- 
ually moving toward increased resistance (the opposite of the reaction). 


Special attention should be called to the differences between these two experi- 
ments. (1) The experimenter was the junior author in the first experiment the 
patients of which were put through the major learning study by the senior author. 
The latter was experimenter in the second CR study, the patients of which were put 
through the major project by the former. (2) The time interval between the end 
of the major project and the CR experiment was much longer on the average in the 
first experiment. (3) Different patients were used in the two experiments, but they 
were selected in the same manner. (4) The duration of the light stimuli was relatively 
long in the first experiment, short in the second. (5) The buzzer which sounded with 
the lights in the second experiment probably caused a more consistent attention 
value. (6) The nature of the UCS was different in the two experiments. (7) The 
distribution of tests (unreinforced CS’s) over the training period was different in 
the two experiments. Testing was done only at the ‘end of the training in experi- 
ment #1; it was done after every five training CS’s in #2. (8) The number of re- 
inforcements was 10 in the first experiment, 20 in the second. 

In both experiments the subjects were stimulated for approximately three quart- 
ers of an hour with green and red lights. The measurement of conditioning used was 
the difference between the reaction to green and to red at approximately the same 
point in time. Thus any time-linked factors should operate equally on both lights 
and if pairing of the UCS with green had any effect it should be evidenced in greater 
reactivity or less latent period. The results presented below are in terms of group 
differences in distribution or central tendency of conditioning, defining the latter as 
enhancement of reaction to a green in comparison with a red occurring close in time. 

All measurements are presented in millimeters. No attempt was made to con- 
vert the measure of amplitude into ohms or any other conventional physical measure. 
The ambiguity of PGR measurements is not eliminated by statement in electrical 
terms. Sensitivity for all patients was kept constant and not changed over the ex- 
perimental period; and the authors believe that the method of measuring CR and 
the non-parametric statistical techniques used get around this ambiguity. 


‘A commercially produced variable transformer was used. 
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RESULTS 


The relative reactivity of different groups to the UCS. The amplitude of a reaction 
was measured as the vertical distance in millimeters from a low point after the be- 
ginning of CS to the highest point within two centimeters of the beginning of CS. An 
increase in height of curve, of course, signals a decrease in skin resistance, the PGR. 

During the training period, when the green light was being reinforced, the ampli- 
tude measure for the green was also a reaction to the UCS, while for the red it was 
only a reaction to the light. Comparison of reactions during the training to green 
and red thus is an indicator of the subject’s reactivity to the UCS. 

Table 1 presents the mean differences in total of reactions to green and to red 
during training. In both experiments the mean, the size of which indicates enhance- 


TABLE 1. MEAN DIFFERENCE IN ToTaL AMPLITUDE OF REACTIONS TO GREEN AND TO Rep DurRING 
REINFORCEMENT OF GREEN (UNIT = MILLIMETER) 








Exper. #1. (Horn = UCS)* Exper. #2. (Shock = UCS)** 
M om N M om 
Normals 89.7 30.5 Fs 282.1 38.9 
Group I (Test) 71.8 26.7 10 270.7 71.0 
Group II 22.1 9.6 11 247.5 33.6 
Group III 55.9 25.0 11 242.6 37.1 





























*Difference is total reaction to 10 Greens minus total to 10 Reds. 
**Difference is total reaction to 20 Greens minus total to 20 Reds. 


ment of reaction by the UCS, is largest for the normals, next in size for the test 
patients, with a marked drop to the means for the two control groups. The differ- 
ences between the groups was not found to be statistically reliable in the case of 
experiment #2. With the experiment #1 means, non-parametric methods of com- 
paring the distributions showed that the differences were significant between both 
the normals and Group I, and the pooled scores of the other two groups. The Mann- 
Whitney U Test yields a P of .019 for the difference between the normals and the 
two controls, a P of .045 for the difference between the test group and the controls. 
Contrasting the number of test patients whose total for green was greater than for 
red, with those for whom this was not the case, the respective numbers are 9 and 0; 
13 and 10 are the corresponding numbers for the control patients. The difference 
between these distributions, using Fisher’s long method of calculation), has a P 
of .018. 


The relative amount of conditioning which occurred in the different groups. The 
latent time between the beginning of a CS and the start of an up-swing in the curve 
of resistance is one measure of reactivity to the CS and can be the basis for inferring 
conditioning. This time measurement, in millimeters of horizontal distance on the 
record paper, was determined for every stimulation in both experiments. If this time 
measurement for green lights is found to be smaller than for corresponding reds, 
occurring at approximately the same time, it may be inferred that conditioning is 
taking place in the form of anticipation. 


Table 2 presents the mean differences, total latent time for initial test trials with 
green minus the same for initial reds. The means for the normals and test patients 
are smaller than those for the two controls. The Mann-Whitney U Test of the differ- 
ence between I and the two controls pooled yields a satisfactory level of confidence, 
P = .02. Similar comparison of this difference (green minus red) for all 10 test trials 
also yields a satisfactory level of confidence. 
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TaBLE 2. Mean DiFFERENCE, GREEN Minus Rep, IN 
Totat Reaction Time FoR THE First THREE Post- 
TRAINING TRIALS. (UNIT = MILLIMETER) 








Experiment 1 





Groups M om 





Normals +1.4 5.7 
Group I (Test) —5.1 6.4 
Group IT +3.4 3.4 
Group III +9.6 4.2 











This time measurement does not show a significant trend in experiment #2. It 
is possible that the shorter duration of CS in this experiment is not sufficient for 
conditioning to evidence itself in this form. 

With the amplitude measure of PGR, of course, the green minus red difference 
should be in the.positive direction to infer conditioning; and, in comparing groups, 
the larger this average difference, the more the evidence of conditioning. Table 3 
presents the mean differences for ‘the four groups in both experiments. The means 
for normals in both are relatively large, compared with II and III. In the second 
experiment the mean for Group I is markedly larger than for the two control groups. 
The Mann-Whitney U Test indicates a high level of confidence for the differences 
between normals and the two controls. This statistical test yields a P of .044 for the 
I versus II and III difference in experiment *# 2. 


TABLE 3. MEAN DIFFERENCE IN TOTAL AMPLITUDE OF REACTION TO GREEN AND 
to Rep For Certain Test Trias, WirHout REINFORCEMENT OF GREEN. 
(Unit = MILLIMETER) 








Experiment 1* Experiment 2** 
M om M om 
Normals 21.7 9.7 . 7.4 


Groups 








Group I (Test) —-1.6 4.1 -1.1 7.4 
Group II —0.8 4.1 —14.1 9.4 

















Group III 5.8 3.5 —10.4 9.4 





*Difference is total reaction for first three test Greens minus same for first three Reds. 
**Difference is total reaction for 8 test Greens minus same for 8 test Reds. 


The differences in conditions of the two experiments are for the most part ones 
likely to favor more conditioning in experiment #2. Probably the most important 
of these are the attention demanding buzzer which sounded with the lights in ex- 
periment #2, the shorter duration of CS, and the noxious nature of the UCS in the 
second experiment. 


SUMMARY AND CONCLUSIONS 


The results of two experiments, with different patients, experimenters, and 
conditions, agree in showing (1) greater reactivity of the PGR to the UCS in normals 
and in chronic schizophrenics who have been through three months of problem 
solving than in chronic schizophrenics who did not receive the learning treatment; 
and (2) that normals and treated chronic schizophrenics are more readily conditioned 
than are untreated chronic schizophrenics. 
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Since evidence presented in a previous paper indicates that the test patients 
were in better clinical condition than the controls, and of course the normals may be 
safely assumed to be so, these results bear out Gantt’s positive finding that CR is an 
indicator of clinical condition @?. 

These results are also in agreement with Pavlov’s ‘“‘pathological inertia” hypoth- 
esis of the nature of schizophrenia“. The relatively low level of reactivity of the 
chronic schizophrenic’s PGR to UCS and the low conditionability are evidence of 
the characteristic rigidity or inertia of the schizophrenic. Furthermore, the results 
of these studies suggest that this inertia occurs on the reflex as well as the cortical 
level. 

In the opinion of the writers the major significance of the results of these condi- 
tioning studies lies in the support they lend an hypothesis about cortical functioning 
in chronic schizophrenia. If a part of chronic schizophrenia is a condition of the 
cortex approximating functional decortication, then a prolonged period of activity 
which forces the patient to use his cortex—such as the learning problems used with 
the test patients here—should to some extent improve the functioning of the cortex. 
Assuming that the CR is an indicator of the level of this cortical functioning, the 
present results can be interpreted as verifying this hypothesis. 
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EVALUATION OF TOPICS IN THERAPY GROUP DISCUSSION* 
GEORGE A. TALLAND** AND DAVID H. CLARK 
Institute of Psychiatry, Maudsley Hospital, London. 


INTRODUCTION 


The literature of group therapy, though extensive, includes few systematic 
studies of the topics of discussion. Reports tend to deal with overt content only in 
relation to inferred needs and motives. Frank “? discussing the problem of research 
in this field advocated the study of the “‘more or less permanent issues about which 
patients are concerned and about which they are trying to do something”’, the study 
of themes perceived by the therapist though not necessarily by the patients them- 
selves. One research included a time sampling analysis of a list of topics, but the 
authors dismissed the findings obtained by this method as being of minor importance 
in comparison with their subjective impressions of what the group discussion had 
been about.“ Plank“) made an attempt to tabulate the frequencies of certain 
‘types of utterances’ over a series of meetings at a V. A. clinic, but his list so far from 
being comprehensive omits certain topics precisely because they tended to be raised 
at each meeting. 

It is hardly surprising that content analysis in therapeutic group discussion 
should have produced no significant results. On the one hand, psychoanalytically 
oriented therapists tend to hold the view that for the purposes of group psycho- 
therapy any topic is as useful as any other, that it is not what patients say that mat- 
ters, but how or in what context they say it. On the other hand, actuarial reports are 
obviously of limited intrinsic interest, for the composition of the group membership 
determines the choice of topics to a large extent. Plank’s veterans will certainly 


talk more about occupational problems, students about religious or philosophical 


doubts than a group consisting of six housewives troubled by marital maladjust- 
ments. 


PROBLEM 


Our object was to investigate the significance of different topics of discussion 
as seen by the patients themselves, and more particularly to test whether they sup- 
port the view held by the so-called dynamic school of psychotherapy, which dis- 
misses the importance of overt content. We predicted, on the strength of casual 
remarks made by group patients, that they would not agree with the dictum that 
any topic is as useful as any other, and formulated also several additional hypotheses 
to test some general psychotherapeutic propositions: (1) group patients distinguish 
various topics of discussion according to their value in therapy, and generally agree 
in their evaluations; (2) in well integrated groups, patients will not markedly dis- 
tinguish between the relative value of a topic to themselves individually and to the 
group as a whole; (3) topics judged to be disturbing are also thought to be helpful; 
(4) no topic is judged entirely useless, wasting the group’s time or hindering its pro- 
gress; (5) those topics will be considered most helpful, which can be openly talked 
about in the group therapeutic setting but not elsewhere. 


SUBJECTS 


The research was carried out on seven groups of patients undergoing treatment 
at the Maudsley Hospital, London. The groups were conducted according to the 
principles of group analytic psychotherapy described by Foulkes® and Taylor, 
which agree in most essentials with the technique of other psychoanalytically 
oriented group therapeutic procedures, e. g. Slavson’s“. In membership the groups 


*The research of which this study forms a part has been facilitated by a grant from the Bethlem 
Research Fund, made available to one of the authors (G. A. T.) by the Governors of the Royal Beth- 
lem and Maudsley Hospitals, London. 

**Now at Harvard University. 
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varied from 5 to 8, some were mixed, others consisted entirely of men or of women. 
The subjects were out-patients who were suffering from the various classes of psycho- 
neuroses and in a few instances from less severe psychoses. The age range was from 
17 to 45; of the 17 men and 26 women in the sample, 28 were married. Most of the 
patients were office and shop workers, though a fair proportion of the women had no 
employment outside their homes; they were all of high average or higher intelli- 
gence. The meetings took place in consulting rooms once weekly for periods of 90 
minutes. 


PROCEDURE 


The material for the research was obtained by asking the subjects to rank items 
of a set list of topics on various criteria. It was essential for the purpose of analysis 
to deal with a standard list of topics, though its compilation involved us in attempt- 
ing to satisfy two conflicting demands. The list had to be comprehensive enough to 
be reasonably representative of the varied subject matter raised in therapeutic dis- 
cussion, yet not so wide as to become unmanageable for the task of ranking. It had 
to include most topics central to the average group patient’s problems, but also some 
more peripheral ones so that we could test in its literal sense the statement that ‘any 
topic is as useful as any other’. The final selection was based on the records of some 
forty group meetings and on the advice of several experienced group therapists. The 
topics were defined in words or phrases drawn from the patients’ own vocabulary so 
as to avoid the theoretical preconceptions of any particular psychotherapeutic 
school, and also in order to be certain that the items to be ranked wquld be all mean- 
ingful to the subjects. The list consisted of the following 15 items: Childhood mem- 
ories; dreams; reactions to others in the group; feelings and thoughts about the group 
doctor; people outside the group; is illness due to physical or psychological causes? ; 
marriage problems; money troubles; problems at work; quarrels, angry feelings; 
children, bringing up children; symptoms, anxieties; social position, class feelings; 
sex; shame, guilt. 

Our object was to discover which of these items the patients thought to be 
helpful or unhelpful in discussion, which disturbing, which—if any—to hinder treat- 
ment. Apart from absolute choices we were also interested in the ranks assigned to 
the topics under each heading. Choices were therefore limited to five on each cri- 
terion, and these had to be ranked. Each task of choice and ranking was performed 
separately, first with a view to the group as a whole, and again with a focus on the 
subject himself. Complete rankings of 15 items were constructed for each subject 
by combining the two partial rankings made for ‘helpfulness’ and ‘unhelpfulness’, 
and by taking the remaining items as tied in the mid-rank position. This procedure 
was adopted in order to simplify the task set to the patients, and was justified by 
the very high negative correlations between ranked frequency scales of ‘helpful’ 
and ‘unhelpful’ items. Thus complete rankings were obtained from each patient for 
helpfulness, and partial rankings on the disturbing and hindering effects of the 
topics, all with a view to the group and again with a focus to the subject himself. 

Patients were interviewed while in a group, following a therapeutic meeting. 
The therapist was not present. Scoring sheets with two rows of four columns each, 
blank but for being numbered from 1 to 5, were issued to the subjects with a set of 
15 cards. In the middle of each card was printed one of the 15 topics, and in the 
top right corner its code letter. Subjects selected five cards for each task, placed 
them in rank order, and noted down the code letters in the appropriate column in 
the same order. Introducing them to the task the experimenter pointed out that in 
group therapy the content of discussion was not regarded as being of primary im- 
portance, that their therapists had told the patients to discuss anything they wished 
to bring up and that this would help them no matter what the topic. None the less, 
the experimenter added, they had probably found that discussion of some topics 
had been more helpful than on others. If that were the case, they were asked to 
select on the basis of their experience in the group the five topics which had proved 
most helpful in the progress of the entire group. The instruction to consider the pro- 
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gress of the entire group and not just their own, was twice stressed and repeated in 
the introduction of each subsequent task. In order of performance, these were selec- 
tion and ranking of the five least helpful items out of the remaining batch of ten 
cards, then of the five most disturbing topics from the entire pack, and lastly any 
number of topics up to five which had been not merely unhelpful but actually inter- 
fering with, hindering the progress of the group. On completion of these four tasks 
subjects were asked to fold back the scoring form so that they could not see what 
they had written down, and the entire procedure was repeated, this time each patient 
selecting and ranking the topics with a view to his own progress. Throughout, sub- 
jects were allowed to choose fewer items if they could not think of five fitting the 
category or any number up to ten, and to rank these. 

In order to test the hypothesis that those topics would be judged more helpful 
which could be discussed in the therapeutic setting but not outside, the research in- 
cluded a panel of 35 judges recruited from psychologists working at the Institute of 
Psychiatry, none of whom had had any direct contact with group therapy. Some of 
the topics were slightly re-worded for this purpose so as to have more general terms 
of reference, and the judges were instructed to consider them being discussed as the 
speaker’s personal problem. They had to perform two tasks: first to mark those 
items which in their opinion could be brought up in an intimate setting only, second- 
ly to rank the entire list starting with the topic most emphatically suitable for dis- 
cussion among close friends and ending with that which could be talked about in any 
situation. Judges performed their task individually, working with a printed list and 
not cards. They were asked to break down the ranking into three sections, beginning 
with the top five and ending with the middle five items. 

Hypothesis I was tested by the concordances of the patients’ group-focused 
total rankings for helpfulness; hypothesis II by correlating each patient’s group- 
focused and self-focused rankings for helpfulness; hypothesis III by tabulating the 


association of individual choices for helpfulness and disturbing effect separately on 
the group-focused and self-focused tasks, and calculating chi square on the entire 
samples} hypothesis IV by inspection of the total number of choices made for ‘use- 
lessness’ or hindering effect; hypothesis V by correlating the judges’ scale for ‘in- 
timacy’ with the patients’ scale for ‘helpfulness’, and by examining the association 
between the topics most highly chosen for ‘helpfulness’ and the frequency of their 
judgment on ‘intimacy’. 


RESULTS 


All subjects were highly co-operative. The patients, with no exception, took 
readily enough to stating their preferences; none regarded himself as unqualified to 
judge the topics for their helpfulness, nor did any one question the meaning or valid- 
ity of the task. A few could not think of five topics which were definitely helpful, 
some more were unable to select as many unhelpful or disturbing items, but many 
of them could not think of several or even one topic which was entirely useless in or 
actually hindered the discussion. 

The patients’ response to the task thus itself confirmed the principal argument 
of our first hypothesis: they all easily distinguished the topics according to their use- 
fulness in therapeutic discussion. They also showed considerable consensus in their 
rankings which gave an average rank correlation coefficient of .32. This consensus 
thus transcends group barriers, and remains significantly higher than could happen 
by chance at the .01 level of confidence even after the exclusion of the four topics 
which are generally judged as least helpful. Concordances of rankings, measured by 
Kendall’s W coefficient, are significant within 6 groups at the .01, in the seventh at 
the .05 level of confidence. Sex and married status account for only minor variations 
in the rank order; product moment correlations between the summated rankings of 
men and women is .82, of married and unmarried patients .86. Correlations based on 
frequencies of choices of the items under the two sets of four headings in table 1 
are given below: 
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TABLE 1. FREQUENCIES OF CHOICES 








Group-focused Self-focused 





Topics Unhelp. | Dist. Hind. | Help. | Unhelp.| Dist. 





Childhood memories 2 3 14 : 6 11 


Dreams : 11 12 : 14 7 
Others in group 7 23 7 17 
Group Doctor 13 , 10 
People outside group , 7 
Shame and guilt 
Illness physical? 
Marriage problems 
Money troubles 
Problems at work 
Quarrels 

Children 

Symptoms, anxieties 
Social position 

Sex 





























Helpful and unhelpful to group -.95 
Helpful and unhelpful to self -.89 
Helpful to group and self .78 
Unhelpful to group and self 91 
Helpful and disturbing to group .78 
Helpful and disturbing to self .79 
Unhelpful and hindering to group .78 
Unhelpful and hindering to self 53 


The high negative correlations between helpfulness and unhelpfulness are a 
most satisfactory proof of the validity of the procedure by which rankings under 
these two headings were combined into a single scale. Correlations between group- 
focused and self-focused rankings of frequencies confirm the findings based on cor- 
relating each subject’s individual complete rankings for helpfulness which averaged 
.59 with a standard deviation of .10. Except for three patients, these coefficients 
were all positive, ranging from .30 to .95. From the high correlations between ranked 
frequencies on helpfulness and disturbing effect, it is evident that there is a general 
tendency to place the same topics either high or low on both criteria. Several patients 
remarked that having chosen and ranked five topics for helpfulness it was unneces- 
sary to put them down again under the heading of ‘disturbing’. Taking the sub- 
jects individually, the topics were tabulated according to frequencies of being 
judged helpful, moderately helpful or unhelpful and disturbing or not disturbing. 
These categories are, of course, relative: helpfulness stands for the most helpful five, 
unhelpful for the least helpful five, moderately for the remaining items. Likewise 
‘not disturbing’ may include some disturbing topics other than the five most up- 
setting. The frequency tabulations are given in table 2. Chi square values are sig- 
nificant at the .001 level of confidence, both for data based on group focused and 
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self focused choices. It is therefore evident that judgments of helpfulness are not 
independent from those made for disturbing effect. 


TABLE 2. ASSOCIATION OF CHOICES FOR HELPFULNESS AND DISTURBING EFFECT. 








Group focused Self focused 
Topic Helpf. | Moder. Unhelpf. | Helpf. Moder. Unheipf. 








Disturbing 113 | 75 12 110 65 18 











Not disturbing 102 | 150 193 95 163 194 











In selecting topics hindering the progress of treatment the subjects varied widely 
between some who chose five items to others who could not think of one. The latter 
numbered 18 judging for the group, 24 judging for themselves individually. Al- 
together 53 entries were made under this heading with the group, 37 with self in 
view, giving averages of 1.23 and .86. These totals rendered a quantitative analysis 
of the data impracticable, and also account for the comparatively low correlation 
between self-focused choices for unhelpfulness and hindering effect, as computed 
from ranked frequencies. 

Table 3 gives the mean ranks of the 15 items computed from the patients’ 
rankings for helpfulness and their mean ranks computed from the judges’ rankings 
for intimacy, as well as the frequencies with which they were assigned to be fit for 
discussion in an intimate setting only. Product moment correlation between the two 
arrays of mean ranks gives a coefficient of .69. Of the five topics judged most helpful 
by the patients, however, only two were regarded as suitable only for discussion in 
an intimate setting, while the other two items so rated, though above the mean, 
are not in the top range of helpfulness. 


TaBLe 3. MEAN RANKS OF HELPFULNESS AND INTIMACY 








Topics Helpfulness Intimacy 





Rank Frequency 





Sex .06 : 21 
Symptoms, anxieties 18 ‘ 15 
Shame and guilt .68 : 30 
Childhood memories .79 . 5 1 
Quarrels 5.86 . o£ 14 
Others in group 6.35 5. 22 
Marriage problems 7.94 . 27 
Group Doctor 7.45 11 
TIilness physical? 
Dreams 

Children 
Problems at work 
Social position 
People outside 
Money troubles 
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Discussion 

The results tend to confirm our hypotheses, though they do not provide con- 
clusive evidence in support of the fifth. Patients do show marked preferences when 
asked to rank topics of discussion for their helpfulness in group therapy. They also 
agree among themselves to a considerable extent on which topics are more, which 
less helpful. In fact, they do not confirm the dictum that ‘any topic is as useful for 
therapeutic discussion as any other’. The statistically significant concordances of 
their rankings within single groups and over the entire sample is not an artifact 
caused by the inclusion of several topics which are almost unanimously ranked low. 
As would be expected discussion of problems most directly relating to manifest mal- 
adjustments is ‘thought to be most helpful and the attempt to relate these to child- 
hood experiences. 

A high measure of group identification is evident from the finding that sex and 
marital status have little bearing on the rank order in which the topics are held to 
be useful to the group, although some of them are necessarily of more direct interest 
to one category of patients than to the other. The high correlation between in- 
dividual subjects’ group and self-focused rankings provides further confirmation of 
this conclusion. The three patients whose two rankings for helpfulness correlate 
negatively were known to be isolated in their groups; one attended irregularly, an- 
other hardly ever participated in the discussion, the third deliberately dissociated 
her problems from those of her fellow patients. None the less, the comparatively 
wide scatter of the correlation coefficients suggests that individual problems are not 
entirely or uniformly identified with group problems. It is also worth noting that 
while discussion about ‘others in the group’ and ‘quarrels’ is thought to help the 
group, these topics are not held to be particularly helpful to individual patients 
personally. On the other hand, many patients believe to have benefited by talking 
about their relatives or friends outside the group, but not by their colleagues’ dis- 
cussion of their relatives and friends. 

The finding that the topics judged as disturbing by a patient matched closely 
the list of items he had found helpful, confirms a basic tenet of psychotherapy. It 
is also evident from the results that, although the patients do not regard all topics 
as equally helpful, they believe that the discussion of any one may be of some value, 
that none would actually hinder the progress of treatment. 

Although the overall trend is to rank topics for helpfulness in the same order 
in which they are judged to be suitable for discussion in an intimate setting only, 
the results do not provide a definite confirmation of the principle that the chief value 
of group therapy is in the opportunity it affords for bringing into the open problems 
which cannot be aired in other situations. By more closely matching the judges with 
the patients for their cultural background, perhaps, stronger evidence might have 
been furnished to support this hypothesis. It is also likely that the two types of sub- 
jects had entirely different childhood memories in mind. 


SUMMARY AND CONCLUSIONS 


By means of asking members of seven analytic psychotherapy groups to select 
items and rank them on several criteria and by analysing the data, the following 
hypotheses were tested and confirmed: 

Group patients distinguish different topics of discussion for their contribution 
to the progress of therapy, and are considerably agreed in their preferences, whether 
they belong to the same group or have had no experiences in common. 

They tend to judge the topics alike whether they have their own personal 
progress or that of the entire group in mind. Topics which cause them greater dis- 
turbance are thought to be more helpful. The discussion of no topic is held to be 
entirely useless or hindering the group in its work. 

There is strong, though not conclusive, indication that topics which cannot be 
discussed outside the intimate atmosphere of the therapeutic situation, are thought 
to be the most valuable items in clinical discussion. 
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A PERSONALITY INVENTORY FOR INDUCTION SCREENING! ? 
JACK R. DANIELSON AND JERRY H. CLARK 
U.S. Army Hospital, Fort Ord, California University of California, Santa Barbara 


INTRODUCTION 


This paper deals with the construction of a psychiatric screening questionnaire 
with validation data obtained at a basic training center now in operation. Profiting 


from the efforts of other workers in this field,’ the test design has considered certain 
criteria: 


1. A personality screening inventory should be short, readily understood, and 
easily scored and interpreted. The pencil and paper, ‘‘true-false” type of test 


satisfies these requirements better than perhaps the situational test ®® or pro- 
jective test ©. 


2. The test should be designed especially for a particular group, as pointed out 
by Ellis and Conrad. 


3. The test should probe psychiatric areas as broadly as possible while still 
fulfilling the objectives outlined in point 1. The most widely used screening 
tests“. 3), for various reasons, measure primarily neurotic tendencies without 
sufficient consideration being given to other psychiatric conditions. 


4. A screening test should elicit a minimum number of “false-positives’’, and 
the statistical evaluation of the test should provide for the manipulation of 
cutting points to effect this reduction of ‘‘false-positive’’ scores. 


5. Such a test should be validated upon an external criterion, even though 
some authorities feel that a validation based upon preinduction psychiatric ap- 
praisal is adequate. For example, Stouffer et al“ state: “If the diagnostic 
criteria implicit in the test were not close to those ordinarily employed by psy- 
chiatrists, then they (the psychiatrists) would not in fact find a disproportion- 
ate number of psychoneurotics in the screened sub-group and would, therefore, 
have no confidence in the test.’”” On the other hand, it has been pointed out“, 


‘The authors wish to express their appreciation to Dr. H. G. Gough who participated actively in 
the eurly planning of this project, in addition to making a number of his own scales available. 

The authors are indebted to the following: Dr. W. D. Altus, who contributed one of the scales, 
and Dr. Starke R. Hathaway, who permitted use of the MMPI items in the present study. 


*The opinions in this article are those of the authors and are not to be construed as reflecting the 
views of the United States Army. 


’An extensive bibliography has been compiled by Ellis and Conrad“) and will not be duplicated 
in this article. ' 
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that screening tests usually correlate highly with psychiatric judgments but 
do not correlate highly with performance criteria. In the case of the test to be 
presented herein, an effort was made to employ familiar diagnostic criteria 
which would have the added virtue of being validated objectively. 


It is desirable to predict combat effectiveness, that is, to use combat perform- 
ance as the validating criteria; however, in practice this would be extremely difficult 
and perhaps impossible. Menninger®?, for instance, believes that such prediction 
is not possible because of such variable factors as morale, motivation, leadership and 
degree of stress which may delay, hasten or prevent an emotional breakdown. Since 
basic training is the first stress situation encountered in the Army and some individ- 
uals do not survive it, prior elimination of these men would be of great value. Basic 
training also offers a reasonably constant environment and +he necessary facilities 
for a validation study based upon performance. For these reasons the completion 
of basic training has been used as the criterion. 


THE Fort Orp INVENTORY 


The Fort Ord Inventory was designed to be given at induction stations to detect 
those individuals who manifest or are likely to manifest psychiatric disability of 
sufficient severity to preclude Army service. Since the final decision as to disposition 
is made by the psychiatrist, the test should be classified as a screening adjunct. 


The Test. Our original intention was to start with a large number of items and 
construct the desired scales by checking each item empirically against the chosen 
criterion. However, because of practical limitations existing in the basic training 
situation this was not possible. Instead, previously validated scales were used and 
these were then shortened and revalidated. The first version of the Inventory (Form 
I) consisted of 187 items divided into 5 scales and 10 ‘“‘stop’”’ items. One scale which 
proved to be ineffective in this particular setting was discarded and another scale, 
measuring psychotic tendencies®?, was added. The final form of the test consists of 
100 items divided into 5 scales and 4 “stop” items. The Psychotic Scale will not be 
reported here. The test length was reduced by means of several item analyses based 
upon both internal validity and external criteria. Validating data are available for 
the following four scales: 


1. Delinquency “’, measures delinquent tendencies, 20 items. 
2. Neurotic“, measures neurotic tendencies, 20 items. 

3. Fake Bad“, measures malingering tendencies, 20 items. 
4. Femininity“, measures feminine tendencies, 16 items. 


The stop items deal with homosexuality, nervous breakdown, epilepsy, and bed- 
wetting. An affirmative answer to any of these items would result in further exam- 
ination of the testee. 


Method. More than 15,000 recruits were given the Fort Ord Inventory within 
one or two days after their induction. The testees were allowed to begin and con- 
tinue basic training regardless of their test scores. Over a period of time enough of 
these men became psychiatric or disciplinary problems to constitute validation 
groups. Because only a relatively small percentage of soldiers become psychiatric 
referrals or go AWOL during basic training, it was necessary to test large numbers of 
men to achieve adequate validation samples. These men had previously been screen- 
ed; therefore, the men more readily detected had already been eliminated. For this 
reason, the task demanded of the Inventory was more difficult than that faced by 
similar tests used exclusively at the induction level. 

For purposes of evaluation of the test, three psychiatric groups and one dis- 
ciplinary problem group were compared with what has been termed a good Army 
adjustment group, and also with what might be called an excellent Army adjustment 
group. The good Army adjustment group consisted of men from two representative 
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training companies, tested before beginning basic training, who then successfully 
completed basic training. The excellent Army adjustment group consisted of men 
from several representative Leaders classes who were tested prior to basic training 
and who subsequently completed both basic and the Leaders course. It should be 
emphasized that all individuals described were tested prior to basic training. The 
poor Army adjustment groups were: 


1. S-4, N=89, so severely emotionally disturbed as to warrant a recom- 
mendation for discharge by the Division psychiatrist. 

2. S-3, N=53, moderately disturbed, not recommended for discharge but 
returned to duty. 

3. 8-2, N=37, mildly emotionally disturbed and also sent back to duty. 

4. AWOL Recidivists, N=38, AWOL two or more times during basic 
training. 

The groups used for standards of good and excellent Army adjustment were: 

1. §S-1, N=415, completed basic training, never in stockade or referred 
for psychiatric reasons during basic training. 

2. Leaders, N =87, in the upper third of class in Leadership School; an 


advanced course, qualification for which is determined by basic training per- 
formance. 


VALIDATION RESULTS 


As noted above, the validation procedure consisted of comparing the good and 
superior adjustment groups with the poor adjustment groups. In Table 1 appear the 
mean and sigma on four scales of the Fort Ord Inventory for these groupings. With 


the S-1 group as a base, t’s were computed on the differences between the means for 
all groups; the levels of confidence of these t’s also appear in Table 1. Of the 20 ¢’s, 
all but 6 are at the 1 per cent level of confidence, and only 3 do not attain the 5 per 
cent level of confidence. The Leaders scored consistently lower on all four scales (a 
high score on any scale indicates maladjustment) and all poor adjustment groups 
scored consistently higher on all scales with the exception of the AWOL group on 
the femininity scale. It is to be noted that all the scales differentiate the S-3 and S-4 
groups at the 1 per cent level and that the ¢’s for the S-4 group range from 3.6 to 6.5; 
by comparison, the S-2 group are not as readily discriminated. This is to be expect- 
ed, as by definition the S-2 group is composed of individuals with only minor emo- 
tional impairments. 

The most important task of a test designed to screen out misfits is the de- 
tection of the S-4 group (unfit for Army service by reason of severe emotional im- 
pairment). The effectiveness of the Inventory in this respect can be seen by referr- 
ing to Table 1. Of next importance is the detection of the AWOL recidivists, who 
comprise the majority of the seriously delinquent in the Army. For this task the 
Delinquency Scale was particularly effective; a ¢ of 5.3 was obtained when the S-1 
and the AWOL group were compared. 

As further evidence of the test’s validity, the Leaders Course graduates score 
lower than the S-1 group (significant at the 1 per cent level of confidence) on the 
Neurotic, Fake Bad, and Femininity Scales, and lower than all other groups on all 
scales at the 1 per cent level. Although intelligence is probably a contributing factor 
in these differences, since the Leaders group as a whole are more intelligent than 
other groups, it is unlikely that intelligence alone can account for the obtained differ- 
ences. The Delinquent, Neurotic, and Fake Bad differences between the Leaders 
group and the other groups are readily explained. The significantly lower score on 
the Femininity Scale obtained by the Leaders group may be explained by an ex- 
amination of the Femininity items. As compared with all other criterion groups, the 
Leaders group are more aggressive, more likely to prefer masculine occupations and 
reject feminine-like occupations, and more oriented toward achievement. 
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Two points should be emphasized in connection with the scores for the AWOL 
recidivist group. First, this was a highly select group in terms of delinquency tested 
before entrance into basic training (as were all validating groups), and thus before 
going AWOL. Because of these facts, the differences between this group and the 
8-1 group, particularly the ¢ of 5.3 on the Delinquency Scale, attain greater signi- 
ficance. Another validation group (N =99), a prisoner group tested after starting 
basic training, scored higher than the S-1 group on all scales significant at the 1 per 
cent level of confidence. The mean for this prisoner group was 9.90 on the Delin- 
= Scale, compared to the mean of 8.84 for the AWOL recidivist group. This 

ifference in mean scores is probably due in part to the effects of incarceration. In 
any case, the t’s on the Delinquency Scale for each of the groups is of such magnitude 
as to leave no doubt concerning the validity of the scale. 

Second, the high Neurotic score for the AWOL group is in accord with a previous 
study“ using a similar delinquent group, which indicated that the existence of 
neurotic trends in individuals diagnosed as ‘“‘psychopaths”’ or severe delinquents is 
not uncommon. 

An examination of the percentile scores on each scale for the 8-1, 8-4, AWOL 
recidivist, and Leaders groups revealed that the most effective scale, the scale which 
discriminated the best, was the Fake Bad Scale. For example, if a cutting score of 
six were used on this scale, 55 per cent of the S-4 group would be picked up while 
picking up 19 per cent of the good adjustment group. The Fake Bad Scale was 
originally designed to detect malingerers. As it is improbable that a large percentage 
of the men recommended for discharge by the Mental Hygiene Consultation Service 
were malingerers, particularly in view of the intensive psychiatric evaluation of these 
men, it seems apparent that this scale was actually measuring some other tendency. 
Apparently, in an Army setting, rather than measuring malingering, it measured 
global maladjustment and poor motivation of a degree sufficient to result in dis- 
charge. 

By using various cutting points on the scales, plus “stop” items, rather than a 
cutting point on just one scale, it was possible to detect 63 per cent of the S-4, 45 per 
cent of the S-3, and 51 per cent of the 8-2 groups, while picking up approximately 15 
per cent of the S-1 group who can be termed “‘false positives’. An additional 5 per 
cent were picked up who were illiterate, semi-illiterate, or mental defective, but these 
cannot be considered ‘‘false positives” as they do need further psychological evalua- 
tion. By use of the same cutting points, 68 per cent of the AWOL group were detect- 
ed as compared with 15 per cent of the S-1 group. The cutting points used for the 
various scales of the Inventory were established by considering different combina- 
tions of cutting points and choosing those which selected the greatest number of 
maladjusted individuals while, at the same time, selecting the fewest normal in- 
dividuals. The cutting points were, of course, arbitrary and may be raised or lowered 
depending upon the particular needs of the situation. When the test is used with 
different groups, statistical methods should be followed which will make it possible 
to adjust cutting points to the particular sample. ‘‘Stop”’ items without any other 
significant scale were responsible for picking up 7 per cent of the S-4 group. Twenty- 
two per cent of the S-4 group would be picked up by the use of ‘“‘stop’’ items alone, 
disregarding scale scores. It is thus seen that ‘‘stop” items are helpful but are not 
absolutely necessary, depending upon the current attitude toward the specific ab- 
normalities covered by them. Enuretics and epileptics do not seem to be detected 
in any other manner. 

Table 2 gives a breakdown of psychiatric diagnoses for the S-2, S-3, and S-4 
groups. Of those recommended for discharge, including all categories of diagnoses, 
63 per cent were detected by the test. In this 8-4 group, those diagnosed as Neurotic 
and as Pathological Personality types were best detected, while Psychotics and Im- 
maturity Reactions were less well detected. In the S-2 and 8-3 groups, again the test 
did not detect the Immaturity Reactions as well as some of the other diagnostic 
categories. The addition of the Psychotic Scale should increase the efficiency of the 
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test in this area. In the case of the Immaturity Reactions, no scale was constructed 
with the aim of detecting this particular category. We know of no scale which is 
oriented specifically to this syndrome, which is not as well defined as other psychiatric 
classifications; in other words, frequently a patient might be classified as an Im- 
maturity Reaction who could also be classified as a Pathological Personality type. 
Considering these facts, it may be worth while in future research to construct a scale 
for the purpose of detecting this Immaturity Reaction group. 


TABLE 2. Fort Orp INveNnTORY II: Psycutatric Cases FaLtiinc WITHIN Diacnostic CATEGORIES 
Detectep (D) AND UNDETECTED (U) 








S-4 8-3 8-2 
Categories U D U D U 





Psychoses* 

Psychoneurotic Disorders 

Character and Behavior Disorders 

A. Pathological Personality 
Types, i.e., Schizoid, Paranoid, Antisocial 
Personalities, etc. 
Immaturity Reaction 
Types, i.e., Emotional Instability, Passive 
Dependency, Enuresis, etc. 

C. Alcoholism 

D. Epilepsy 2 0 


Totals (I, II, III) 56 33 24 29 19 18 


*It is expected that the addition of the Psychotic Scale will improve the test’s efficiency in this 
category. 

















DISCUSSION 


In no main category in the case of the S-4 group did the test fail to detect less 
than 50 per cent. One might ask whether this percentage of “hits” could be improved 
by some method such as lowering the critical scores. It was found in examining the 
scale scores that manipulation of cutting points would not substantially improve the 
effectiveness of the test. It seems that the test either clearly detects individuals or 
does not detect them; there are few borderline misses. From the data presented in 
Table 2 it is evident that there is no single psychiatric classification that is being 
overlooked; i.e., it can be seen that the misses that did occur are spread over all the 
diagnostic categories rather than being limited to just one or two categories. 

In order to account for those individuals who were undetected by the test, it 
may be hypothesized that some inductees were “‘faking good” at the time of the test 
(this is certainly true of many epileptics and enuretics), in which case further stress 
on truthful replies might be effective. It may also be that some inductees were un- 
detected because they deny emotional conflicts. It has been the attitude of the 
authors that those of borderline adjustment who were “faking good”’ were sufficiently 
well motivated that they should be given a chance to succeed in the Army. As far 
as the misses are concerned, another possibility is that some of these men may have 
led such a protected life prior to induction that they may have little recognition of 
their inadequacies until overwhelmed by the unusual stress of basic training. 

The authors believe that the Fort Ord Inventory meets the requirements for a 
screening instrument outlined in the introduction. It is short, readily understood, 
quickly scored, and easily interpreted. It screens for the major types of emotional 
abnormalities. It has been demonstrated that it is effective in its application to the 
group for which it was designed. Finally, the Fort Ord Inventory is performance 
validated, using a criterion which is tangible and objective. 
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SUMMARY 


Prerequisites for a psychiatric screening test to be used in the examination of 
large numbers of prospective inductees at induction stations were described. A 
specific screening test, the Fort Ord Inventory, for selecting maladjusted individuals 
for psychiatric interview was presented. The test consisted of scales measuring 
neuroticism, delinquency, femininity, army maladjustment, and psychotic tenden- 
cies. Validating data presented for four of these scales indicate the overall effective- 
ness of the test as a screening adjunct. 
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A CRITICAL REVIEW OF H-T-P VALIDATION STUDIES* 
WILLIAM SLOAN 
Lincoln State School 


This paper reviews in a critical way some of the validation studies which have 
been done on the H-T-P, but it does not evaluate the H-T-P or projective techniques 
in general. We will present a brief statement of the nature of validity, inspect some 
of the studies which have appeared, and conclude by pointing out some of the ways 
in which such studies may be improved. 

Any discussion of validity must recognize that there is no such thing as “‘valid- 
ity’ without some qualification. No test or technique is valid except in the sense 
that it is valid for some purpose. A given test may be valid for several different pur- 
poses and the validity in each instance may differ. Thus, we speak of different 
‘‘validities” and should specify the kind of validity which we have in mind. The 
concept of validity itself consists of two aspects. One of these is reliability or the 
consistency with which the test measures whatever it does measure. The second 
aspect of validity may be termed relevance, the correlation between the test and a 
criterion. A criterion is the function that the test attempts to measure, and is one 


*Based on a paper presented as part of a Symposium on “An Appraisal of the H-T-P Technique 
as a Clinical Instrument” at the Annual Meeting of the American Psychological Association, Cleve- 
land, Ohio, September 6, 1953. 





144 WILLIAM SLOAN 


standard against which the test is evaluated. In clinical psychology our criteria 
usually cannot be known directly. For the most part they consist of highly abstract 
conceptualizations such as “‘anxiety’’, “intelligence’’, “‘affectivity’’, etc. If we were 
to pretend to measure these things directly we would be in a dilemma, since each 
of these concepts in itself consists of a number of homogeneous facts or internal con- 
ditions. We would have to know each of these separate conditions before we could 
have some valid knowledge of the “trait”. As Cureton ®) has put it, the useful ulti- 
mate criterion must imply unambiguously or specify explicitly an area of behavior 
which is operationally scorable. Since the internal conditions which we are postu- 
lating are operationally undefinable, they are, therefore, intrinsically invalid symbols. 


If we were to stop here clinical psychology would just have to discontinue at- 
tempts to measure personality traits and would have to limit itself to measurement 
of pure, single, homogeneous characteristics. This, of course, is not feasible. To 
evaluate personality it is necessary to postulate the existence of hypothetical var- 
iables and then proceed to define them operationally in terms of observed behavior. 
In doing this we must set up rigorous standards for observation and measurement 
of behavior. Qualitative and subjective judgments have dubious acceptability in 
this frame of reference. It is important for our purpose to specify the materials, 
actions, situations, and scoring criteria implied by the abstract terms which denote 
our hypothetical constructs. It is because of the inferential nature of the experi- 
mentation which is necessary in this field that we must be explicit in stating our 
hypotheses, the nature of our validating criteria, the hypothetical propositions 
upon which these are based, and the inferences which are made in the logical process 
which is part of the validation study. In the light of these general principles we shall 
attempt to evaluate a few of the H-T-P validation studies within this frame of 
reference. 

Probably the most systematic attempt to validate some of the hypotheses of 
the H-T-P has been made by Jolles. In a series of four studies he has investigated 
the hypotheses concerning sexual identification, the phallic tree as an indicator of 
psychosexual conflict, horizontal placement, and vertical placement of the drawings 
on the page. In the first of these studies“) he examines the observation suggested 
by Buck, that subjects who draw the H-T-P tend to draw individuals of their own 
sex. He has limited his study to grade school age children. His findings in general, 
serve to support the observation. This is interesting information, but it has little 
or no bearing on the validity of the H-T-P, since this finding is not related to an 
independent criterion. No actual examination of an hypothesis has been made, it 
simply supports an observation. 

Going just this far, there is no objection to the procedure. However, he has gone 
beyond this and found that there were some differences in the ratios for children of 
different ages. For example, eleven and twelve year old girls tend to draw more 
figures of the opposite sex than the younger girls and also than boys. He then sug- 
gests the relationship between this tendency and the phenomenon of adolescents 
wearing masculine clothing, and suggests that this might give us clues regarding 
differences in psychosexual development of girls and boys. There is an implication 
here of some inferences which certainly should be spelled out concerning the relation- 
ship between psychosexual development, sexual identification, and the preponder- 
ance of drawings of the same sex. It is a moot question whether or not the subject 
identifies with the person that he has drawn, since not all projective psychologists 
subscribe to this notion. Levy ®: »- 7 states, “I have concluded that a drawing may 
be a projection of self-concept, a projection of attitudes toward someone else in the 
environment, a projection of ideal self-image, a result of external circumstance, an 
expression of habit patterns, an expression of emotional tone, a projection of the 
subject’s attitudes toward the examiner and the situation, an expression of his atti- 
tudes toward life and society in general’. Jolles presents no evidence to justify his 
second conclusion that the sex of the drawn person tends to represent the felt sex 
role of the subject. 
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It is curious to find some statement concerning the need for caution in inter- 
preting homosexual trends in a six-year old boy who drew a female person. Here we 
have an implied inference, without much evidence, that the sex of the drawn person 
may be related to homosexuality. A weakness in this type of study is the failure to 
make explicit the inferences and to point out the steps which need to be taken in 
going from a bit of observed behavior, such as the sex of the drawn person, to a 
hypothetical construct such as psychosexual development. Jolles has succeeded in 
demonstrating that children do tend to draw figures of the same sex, but any infer- 
ences that go beyond this cannot be accepted on the basis of these data because no 
logical relationship has been demonstrated. 

In the second of this series of studies Jolles“ has investigated the phallic tree 
as an indicator of psychosexual conflict. Here he finds, not too objectively perhaps, 
the characteristics of what he calls ‘the phallic tree”. He mentions that it has oc- 
curred with considerable frequency in the H-T-P’s of mental defectives. Aside from 
this statement concerning mental defectives in the opening sentence of the study, 
there is no further mention of this clinical group. The point of departure for this 
study is that “the occurrence of this type of tree in a normal adult might very well 
be interpreted the same as a very large chimney, i.e. overconcern about sexual mat- 
ters with a need to demonstrate virility, or as indicating exhibitionistic tendencies’. 
(6, p. 46) The source of this statement is given as Buck. The reasoning is not too 
clear here. If this is a study to investigate Buck’s statement as a hypothesis, then 
one should not start the study by postulating that a so-called “phallic tree” indicates 
an overconcern about sexual matters. He seems to accept, and take for granted that 
such an interpretation is correct and now is concerned with determining whether 
or not this interpretation applies to young children. What is the validity of this in- 
ference if it should turn out that Buck’s hypothesis is not correct? His procedure in 
this study is to use the technique of ‘‘internal consistency”. This is done by pointing 
out certain characteristics of the chimney and person without relating them to some 
outside criterion. It would be just as reasonable to suppose that none of these things 
have any relationship to psychosexual disturbance, but since this is a closed system 
there is no way to find any evidence for this hypothesis one way or the other. 

It is desirable to have evidence concerning the reliability of his scoring of the 
chimney, the tree, and the person for the presence of ‘‘sexuality”’. If another investi- 
gator evaluated the chimney, the tree, and the person would he find the same in- 
cidence of “‘sexuality”’ in these drawings? 

One can speculate at this point whether this study is concerned with the valida- 
tion of the H-T-P or with some inferences concerning psychosexual development. 
The core of this problem seems to be the need for some experimental evidence con- 
cerning the theory of psychosexual development. As yet it is not too clear on the 
basis of available experimental evidence that psychosexual development occurs in 
exactly the way in which psychoanalytic theory postulates. The tying-in of the 
drawings to this theory is a tenuous procedure if the theory of psychosexual develop- 
ment is assumed to be the validating criterion. 

In the third study by Jolles and Beck“, “horizontal placement of the drawing”’ 
is investigated as it applies to the concept of the psychological center of the page. 
Buck’s hypothesis is that the horizontal placement of the drawing is related to time 
in the life of the individual who makes the drawing. The left side of the page pertains 
to the past and the right side of the page to the future. Another interpretation ap- 
plied to horizontal placement is that the farther the drawing appears to the left of 
the psychological center, the more the subject has a tendency toward immediate 
emotional expression, while the farther to the right of the psychological center the 
greater the intellectual control. The first finding in the study is that the psycho- 
logical center of the page is indeed to the left of the geometric center. These differ- 
ences, however, range from a tenth of an inch to a third of an inch. In each case the 
standard deviation for these means is more than half an inch. Certainly one would 
have to agree with Jolles and Beck in this case that it would be difficult to make any 
interpretation from the variation for a single subject. 
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Further in this study they find “meaningful” relationships between horizontal 
placement and the development of intellectual control. The assumption here is that 
intellectual control increases directly with age. They find such a relationship for 
the house, none for the tree and an inverse relationship for the person. Jolles and 
Beck then go on to interpret their findings in terms of the different psychological 
meanings which the house, the tree and the person have. They should have stated 
these hypotheses early in the study since the justification for the findings after they 
have been obtained is not sound scientific procedure. One gets the feeling that if the 
opposite results had been obtained they also could have been “explained” in terms 
of some facet of Buck’s theory. 

One must take issue with the final conclusion of this study. It states, ‘““Accord- 
ing to the criteria suggested in this study, Buck’s interpretation of the stimulus 
value of the three wholes seem to apply to the subjects of this study”. It would ap- 
pear that the objective criterion for this statement rests upon such statements as 
“the older the child becomes, the more readily he lends himself to the regulations 
set for him by the parents’’, and again “‘the basic reaction pattern of subjects does 
not change with age to a significant degree beyond the age of five’ and, finally, 
“the older a subject the more social he becomes.’’ This may be considered an ex- 
ample of reasoning by analogy. One could almost set up alternative hypotheses and 
proceed to justify them with as much reasoning as has been done with the hypotheses 

resented. 

" In the fourth study in this series“) the validity of the hypothesis concerning 
vertical placement was examined. The hypothesis suggested that placement of the 
drawing above the psychological center of the page indicates an attempt to gain 
satisfaction from fantasy and /or over-striving. Also placement of the center of the 
drawing below the psychological center of the page suggests a tendency to be reality 
bound, a tendency to feel insecure, and /or a depressive mood tone. The statement 
which follows seems to be the core of the logic of validation in this study. “On the 
basis of clinical experience with the Rorschach technique, ability to seek satisfaction 
in fantasy increases to a certain extent with age, i.e., older children may be expected 
to produce more M than younger children. Applying this to Buck’s interpretation 
of vertical placement, we would expect a positive correlation of the height of the 
drawings on form page with age.”’ It is not clear why the M on the Rorschach is 
used as the criterion in the present study. For example, we might suggest that on 
the basis of clinical experience with the Stanford-Binet, mental age increases with 
age. Consequently, if we find higher placement of drawings on the page with an 
increase of age we might logically suggest that placement of the drawing on the page 
is directly proportional to mental age. The same kind of reasoning may be used with 
a number of other factors of a psychological or even physical nature which increase 
with age. Just why it would have to be fantasy is not too clear. 

Further in this study, differences in the placement for the three drawings are 
explained in much the same manner as was done in the previous study. In the final 
statement of the study they point out that the mean vertical placement is higher 
for girls than boys on all three drawings and then suggest that this means a greater 
tendency for girls to seek satisfaction in fantasy than boys, or that there is greater 
striving among girls than boys. Here we have the error of the converse in logic. The 
original purpose of the study was to determine whether or not the placement of the 
drawing is related to fantasy and/or striving. Now we find that the reverse takes 
place; namely, that the placement of the drawings indicate greater tendencies for 
fantasy in girls than boys, or greater striving among girls than boys. There is no 
independent evaluation of fantasy or striving in any of the subjects outside of the 
drawings themselves. We lack a criterion in this study and consequently, no con- 
clusions concerning validity should be made. 

In summarizing these studies one is impressed with the tremendous amount 
of work which has gone into them. The numbers of subjects used are quite adequate 
and appear to be a fairly good sample of grade school children. The amount of data 
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derived is considerable and can be evaluated in a number of ways. It probably is 
difficult to get some clear picture of the logic which has followed in these studies. 
There is some confusion as to whether or not the studies attempt to verify the hy- 
potheses of the H-T-P directly, or to verify Buck’s hypotheses on adults as to their 
applicability to children, or to verify some hypotheses concerning psychosexual de- 
velopment independent of Buck’s hypotheses. We find illustrations of all three of 
these scattered through these studies. 

It may be suggested at this point that it may be well to discard any or all pre- 
conceived notions concerning Buck’s hypotheses, or the psychoanalytic hypotheses 
concerning psychosexual development, and simply to use the data of this study to 
set up some kinds of developmental norms. These norms might be in terms of the 
prevalence of various characteristics of the drawings at various age levels and for 
different sexes. Ideally one would hope to have some independent criteria of person- 
ality traits available on each subject. This is not feasible from a practical point of 
view, consequently, the attaching of peculiarities of personality traits to any pecul- 
iarities of drawings is not logically feasible. What could be done is to attempt to 
find clusters or groups of drawing characteristics which appear to be related, within 
the drawings of each individual. Just what meanings a cluster would have would 
still remain a problem for further research and should be based on independent 
validation using some estimate of personality traits derived from some independent 
source. However, the presence of such clusters of drawing characteristics might 
provide some logical leads to the investigation of personality characteristics. In a 
small way this is what Jolles has done in the study on the phallic tree. Here he has 
used three symbols as a cluster, namely, the chimney, the phallic tree, and the 
presence of sexual symbols in the drawing of the person. The weakness in his study 
is that he started out with the pre-conceived notion that these drawing character- 
istics represented psychosexual disturbance. This conclusion should follow only 
after a great deal of evidence has been accumulated. 


Still another type of error is illustrated by the report given by Deabler®. This 
paper is replete with such statements as, ‘‘tend to give”, “sometimes also found’’, 
“seldom try to produce”, and “are frequently drawn’’, ad infinitum. To take a 
specific illustration, “Epileptic patients are often found to draw their persons with 
slanting posture, this evidently reflecting the experience they have of falling over.” 
What we need to know here is (a) how many epileptics do not draw their persons with 
slanting posture, as well as (b) how many do. In addition we should know (c) how 
many in a group of ‘‘normals” draw their persons with a slanting posture and (d) 
how many do not. Here we have the elements of a four cell contingency table. 
Finally, even if a significant ratio of epileptics do conform to the hypothesis, what is 
the rationale for the conclusion “evidently reflecting the experience they have of 
falling over’’? 

There is much similarity between the review which Cronbach) makes of 
Beck’s book and some of the type of reasoning which has characterized the valida- 
tion studies of the H-T-P. We might paraphrase Cronbach’s statement and sub- 
stitute H-T-P for Rorschach and Buck for Beck. We would then have, ‘Before the 
H-T-P can hold an established position among those who are deficient in the ‘wish 
to believe’ the interpretative statements such as Buck presents must be accompanied 
by explicit evidence. Such evidence would not say (to quote at random) ‘horizontal 
placement as an indicator of emotional expression and intellectual control applies 
to the subjects of this study’ but would say ‘x persons out of one hundred tend to 
draw the figure to the left of the mean of the page’. Here is the crucial problem. 
Buck (Jolles, Deabler) and others word statements as if generally true, although the 
context makes clear that the interpretations are not true in one hundred cases out 
of a hundred. Hence, the propositions are not to be accepted as stated. This ap- 
proach to the H-T-P does not leave us with testable claims. The H-T-P enthusiast 
can say that the truth is contained only in his many qualifications. But this too 
easily becomes an evasion of responsibility.” 
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We see, of course, that the H-T-P has many problems. However, they do not 
appear to be any more or any less than most of the projective techniques now in 
vogue and are not at all unique but are a special case of the generalized problems 
of the whole field of projective psychology. 

In summarizing, we see that the validation studies of the H-T-P are character- 
ized by lack of clear, logical statements concerning the concept of validation. In- 
ferences are made and are more frequently implicit than explicit. These character- 
istics in some part may be due to the early stages of the development of these tech- 
niques. There does not seem to be any good reason why systematic programs for 
setting up the kinds of research which will answer these questions cannot be done. 
If hypotheses are stated in testable form and if the steps necessary for logical de- 
velopment of validation are followed, in due time there can be developed a fairly 
sound clinical approach to the evaluation of personality using some of the methods 
presently available. However, clinicians should become aware of the fact that bland 
statements to the effect that “it works’ no longer will be accepted as satisfactory 
for scientific acceptance of a technique. Hutt“? defends the clinician who feels that 
using a certain technique in this way does provide him with usable clinical results, 
but he should recognize then that he is functioning as an artist and not as a scientist. 
The purpose of validation is to supply the clinician with information which will 
enable him to evaluate his clinical results. The transition of clinical psychology 
from an art to a science will be made only as clinicians become more sensitive to the 
need for sound validation studies. 
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CLASSIFICATION AND TREATMENT PROBLEMS IN A CASE OF 
ENCEPHALOPATHY 


BESSIE PAZEIAN 


Lyman School for Boys, Westborough, Mass. 


INTRODUCTION 


Increased interest in the study of encephalitic children who perform like mental 
defectives has initiated a new area of scientific investigation. Where these children 
exhibit behavior disturbances so severe that they become known to the courts, the 
problem assumes a dual aspect, that of disposition in addition to classification. That 
these children are frequently mistaken for mental defectives because of test scores 
is one aspect of the present case study. The other, and certainly the more vital con- 
cern, is the unusual therapeutic problem which they present and the lack of suitable 
facilities for treating them. 

Henderson and Gillespie“) note that the chronic course of this disease creates 
special problems in administration partly because there has been a natural reluct- 
ance to place such children in mental hospitals and partly because the resulting symp- 
toms have been of a type peculiarly difficult to manage. The conduct disorders 
characteristically seen in these children and adolescents create social as well as 
medicolegal problems. 


It has been established that cases of organic brain damage reveal memory as 
well as perceptual impairment in mental function, ®? but it has long been a matter 
of conjecture whether a test of general intelligence such as the Stanford-Binet can 
discriminate between the performance of a child with organic brain disease and that 
of a normal child. Klebanoff® states, “The Stanford-Binet and other tests of gen- 
eral intelligence are heavily weighted with reference to test items of memory func- 
tion. Yet, it is surprising to find the relatively limited number of investigations of 
a systematic nature of memory ability in cases with cortical damage.” 

The second deficit, that of visual motor function is treated by Bender“) who 
notes that errors made by patients with organic brain involvement on the Bender- 
Gestalt Test appear to be of a definite and constant type and differ from the errors 
made by the ‘functional’ disease group as well as by those of normal children. 
Bender calls this a pathological performance and feels that children revealing cortical 
damage are unable to perceive and reproduce the forms due to an inherent disability. 
She concludes that the impulse, perceptual and confusional disturbances are symp- 
toms of organic brain damage. Thus it would seem that with and even without 
specific diagnostic tests it may be possible to detect anomalies of function in “‘or- 
ganic”’ cases by a more critical examination of the specific abilities in which they 


reveal disturbances. For our purposes, we will consider exclusively the memory and 
visual motor function. 


It may be well to review here the clinical picture of children with chronic epi- 
demic encephalitis as described by Wimmer.) In his opinion, the psychic disturb- 
ances of these children set in late and slowly and are punctuated by restlessness, 
irritability, temper tantrums, mischievousness, destructiveness, abusive language, 
violence, arson, cruelty to children and animals, truancy, vagrancy, dishonesty and 
precocious erotism with obscene language and ‘conduct. The children seem to be 
transformed into “‘little devils”. Punishment has little effect. Inverted sleep rhythm 
is very frequent and the nocturnal wakefulness is accompanied by great psycho- 
motor excitement. Intellectually there arises a secondary intellectual deficiency 
from lack of concentration and interest and fatiguability with consequent unreliable 
perception and imperfect retention so that in school, these children appear back- 


ward. Even without confirmed etiology of the case under discussion, these symptoms 
are of note. 





BESSIE PAZEIAN 


THE PROBLEM 


The present paper concerns the case of a boy, aged 13, sent by the court to the 
Youth Service Board on charges of Breaking, Entering and Larceny, and Willfully 
and Maliciously Setting Fire to a Building, later diagnosed a mental defective, (W. 
I. S. C. Full Seale IQ 48), and sent to the Lyman School for Boys for a period of 
study and training. Etiology, either of the mental defect or the behavioral disturb- 
ance, was not further identified. 

Within two weeks of his transfer to our care, we became interested in his case 
specifically because he did not function like a mentally defective child. He was a 
friendly, approachable boy who was alternately alert and drowsy, at times somewhat 
vague and listless and again hyperactive and fidgety. His eyes held a “glassy” stare 
and occasionally he smiled inappropriately. Inherent good endowment was sus- 
pected from his coherent and sensible stream of speech. In addition, the previously 
administered Wechsler Intelligence Seale for Children (Full Scale IQ 48, Verbal 1Q 
66, Performance IQ —) yielded findings obviously meriting further analysis. When 
the actual test sheet arrived, it was noted that scores on the Sub-Tests of the per- 
formance scale were as follows: 


Picture Completion 
Picture Arrangement 
Block Design 

Block Assembly 
Coding 


On the verbal scale the most diagnostic finding was that of rote memory where digit 
span was 5 forwards and 3 backwards. These sub-test findings are highly significant 
of the performance of organic cases where there is a disturbance of the visual motor 


sphere, memory defect, and a lowered capacity for organization and synthesis. 


CasE History 


History revealed normal birth with retarded development, difficulty with 
teething, and slowness in walking and talking as well as a severe case of measles in 
infancy which was almost fatal. The home was adequate. Both parents were of 
excellent repute, temperate and of good moral caliber. History of the 4 other sib- 
lings was entirely lacking with respect to behavioral difficulties. Some rivalry existed 
between this boy and siblings. In school, he was considered ‘‘sneaky and smooth’’, 
a poor scholar, 4 years retarded but capable of doing better work. He abused other 
children, was a persistent truant and was openly defiant of every school rule. “He 
showed no respect for persons or property”, and frequently made the statement to 
his teachers, ‘“‘Only suckers go to school.” His pastor who knew him well, said of 
him, ‘He is a sneak thief of the first water. He has the smoothest face I have ever 
seen including 7 years at the Women’s Reformatory. In my opinion, he is a confirmed 
criminal.”’ Other than the court record listed above, there were 7 unofficial appear- 
ances, offenses ranging from Breaking windows, Attempt to Break and Enter, Larc- 
eny, and Breaking and Entering. In the home, mother reported poor sleeping habits 
with nightmares in which he would scream and physically battle with his brother. 
He craved excitement, was easily aroused by other boys and often succumbed to 
their influence. He was frequently found late at night in alleyways hiding out. 


CourRsE AND PROGRESS OF THERAPY 


The following are excerpts of actual records, tests, and examinations over an 8 
month period of study. (Therapeutic interpretation has been purposely omitted 
since it does not add to the basic problem here involved.) 
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DATE ENTRIES 


5-29-52 Psychotherapeutic interview. “Since his transfer to the Lyman School, he has shown in- 
stances of somnambulism. It is reported that he is inclined to scream in his sleep, walk 
around, climb on windows and beams of the dormitory, and occasionally crawl] on the 
floor. He has no recollection of these activities the following morning. In interview, he 
admits some sibling rivalry and an inability to adjust to his contemporaries. ‘I get mad 
when other kids boss me around. I start screaming, swearing, or fighting.’ Relative to 
his fire history, he states that he walked into the house, found it unoccupied and set 
several papers on fire. After the fire had been set, he had the feeling of being glad that 
he had done it. He admits previous fire-setting episodes chiefly grass fires. His thinking 
is abnormal and bizarre. ‘I think queer things lots. I once had a dream that someone 
stabbed me. When I woke up, my brother was beside me. I thought he did it, so with 
my hand I tried to stab him.’ He frequently dreams of hanging from cliffs and falling. 
Following such dreams, he awakens with a headache. During the day he is frequently 
drowsy, and dozes at odd intervals. 


Impression: This youngster with history of retarded development, difficulty with teeth- 
ing and slowness in walking and walking as well as a severe case of measles resulting 
almost in death, is presently revealing symptoms of organic pathology, possibly the 
result of an early encephalitis. His current behavior is pathological and his mental con- 
tent bizarre. On a Bender-Gestalt examination administered today, there are evidences 
of cortical damage, a lack of conception of spatial relationships and at times a complete 
distortion of the gestalten. (See Figure 1, form 3.) 





Fic. 1. Benprer-Gestatt Test Resutts on 5/29/52. 


The IQ of 48 does not appear to be representative as he is alert and has insight. Very 
probably, this low score is the result of impairment in specific perceptual areas of psychic 
function. We are referring his case to our consulting psychiatrist for complete neurolog- 
ical work-up.” 


Cottage report. There are chronic night disturbances in which he will sit up in bed and 
scream, is prone to temper outbursts, strikes out at other boys without provocation. 


Psychiatric examination. “Neurological examination is negative. This is a youngster 
reportedly suffering with nightmares, the content of which he has no memory of in the 
morning. His emotional responses are wry sa ape I asked him why he came and he 
says, ‘I lit a fire’ and grinned. An empty house was set on fire. There is a history of 
retarded development in growth from birth . . . In addition to that, this youngster has 
had a severe case of measles which may have resulted in encephalitis and this may be 
contributing to his present conduct disturbance. There are neurotic traits and question- 
able sibling rivalry. A recent IQ rated him 48. I would like to see him re-tested. I do 
not believe him to be psychotic or committable at this time and recommend that further 
neurological investigation be accomplished, EEG, spinal fluid studies, and skull plates. 
I would recommend that Benzedrine, .005 gm. be given at bedtime.’ Note: (On this 
— it was agreed that he could best be treated symptomatically as well as therapeu- 
tically. 
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6-5-52 Psychometric examination. Stanford-Binet, Form M, C. A. 13-4, M. A. 10-8, IQ 81. 
Scatter is from the 8 to the 14-year level. Deficits in mental functioning are most prom- 
inent cn tests measuring ert imagery and memory. On the other hand, some verbal 
functions are of a high level, namely sentence structure, vocabulary, and ideational 
reasoning. Specifically, organic impairment is noted in faulty auditory and visual mem- 
ory, inability to reproduce designs (See Figure 2, Designs [X and XII), and low score on 
digits backwards (3). 

Summary: Despite the present rating, we feel that this is a boy of average intelligence 
whose present mentality is impaired due to certain marked deficits in the visual motor 
realm of mental functioning. 
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Fic. 2. Sranrorp-B1neEt Items on 6/5/52. 


Electroencephalographic study. ‘Localization studies revealed poor synchronization 
from the occipital areas but no discrete focus is elicited.” Results considered not re- 
liable because of hyperactivity of subject. 


Psychotherapeutic interview. “Subject describes episodes as follows: ‘A few days ago, I 
was talking to a boy when I suddenly got dizzy. The boy was still there, but I don’t 
remember what happened next. He says my eyes were staring and out of focus’. Before 
falling asleep at night, he can hear the sound of a train. The train appears to be coming 
directly at him. In panic, he jumps out of bed. He has no recollection of somnambulistic 
activities as described by cottage personnel.” 


Psychiatric examination. “He alleges that he is suffering with spells which he describes 
as a strange feeling. He would be looking at something get a strange feeling, then it 
would stop and he would get dizzy. There is a marked iscrepancy in results of the 
various intelligence tests. He was rated with an IQ of 48 and just recently received an 
IQ of 81. There is one consistent feature, however and that is difficulty in performance 
and in the visual motor function. May I suggest that he be tested again and that he 
have the benefit of another EEG. I would continue to treat him symptomatically.” 


Repeat EEG study. ‘‘Conclusion: Essentially negative tracing at rest, showing moderate 
amount of cortical instability under stress. This EEG differs from previous recording 
of 6-10, in that tracing is more unstable during hyperventilation studies.” 


Psychotherapeutic interview. “Subject reports additional dizzy spells. ‘I think many 
queer thoughts. The other day, while I was sitting in the shade by the water tower, I 
had the feeling that the tower was going to fall and crush me. The thought stayed with 
me a long time, just like it was glued right to my head’”’. Note: (Consulting psychiatrist 
and school physician conferred this date regarding medication. Dilantin, gr. 4 t.i.d. 
to be initiated in place of Benzedrine). 
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' 
8-15-52 Psychotherapeutic interview. ‘tor the past 3 weeks there have been no dizzy spells and 
fewer night disturbances. Sleep is more restful. There is less fatigue during the day. 
The bizarre and disturbing thoughts have abated to an phate = 5 degree. There is 
evidence to believe that medication has aided in allaying the hyperkinesia and dis- 
turbed thought content. Social adjustment is somewhat better. ect is warm.” 


8-15-52 Psychological re-examination. Stanford-Binet, Form M. C. A. 13-6, M. A. 12-8, IQ 95. 
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Fic. 3. Sranrorp-Binet Items on 8/15/52. 


(In examiner’s opinion, practice effect is almost negligible in that comparison of items 
on previous examination with current analysis of individual tests indicates little or no 
consistency of performance.) ‘Today's examination which again revealed many in- 
consistencies of mental function and some perseveration reveals nonetheless a mental 
age higher by 2 years than the previous examination findings. Logical reasoning and 
rote memory, vocabulary, and verbal abstractions were on a high sphere. Digit span 
was 8 forward and 5 backward. Relatively simple tests such as Contig and Copying 
a Bead Chain were failed. Scatter was from the 9 through the 14-year level. All tests 
were failed at the average adult level but a test of rote memory was passed at the Super- 
ior Adult I level. It is indeed of interest to note wherein the discrepancies lie between 
this and former examination. Perhaps the most noteworthy findings are his exceptional 
handling today of rote memory items as well as tests measuring visual imagery. (Figure 
3). These functions were markedly impaired on previous testing. There is little doubt 
but what this youngster is still operating under cortical stress. Inconsistencies persist 
although performance is dramatically improved, specifically on visual motor and mem- 
ory function. We are in hopes that under medication organic symptoms will reveal 
abatement. These test results are of extreme interest, however, in that they indicate 
that this boy who came to us with an IQ of 48 and was considered a mental defective 
is actually of average intelligence. Granted there are impairments in certain spheres of 
mental functioning.” 

Note: (At this time we questioned the etiology back of the improved performance in the 
visual motor and memory function, asking whether medication was the sole factor 
responsible for dissipation of the previously noted gross anomalies or whether other 
factors including the therapeutic process were also operating toward such purpose.) 


Spinal Fluid studies. Pressure and spinal fluid negative. 
Psychotherapeutic interview. “There are still sleep disturbances but no additional dizzy 


spells. He says that he has an uncontrollable temper on occasion, is prone to be argu- 
mentative and engages in many fights.’ 
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1-8-53 


1-8-53 
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Cottage report. Cottage personnel note that when he strikes out at other boys, he does 
not appear to know what he is doing. 


Medication changed to Benzedrine—.005 gm. at bedtime. 
Psychiatric examination. “This youngster with history of encephalopathy continues to 
manifest behavior difficulties and some maladjustment. He was recently demerited for 


‘blowing a fuse during an attempt to light a cigarette’. Treatment should continue to 
be symptomatic as well as therapeutic.” 


Psychotherapeutic interview. ‘He is a likeable, oping boy who is easy to reach thera- 
peutically. He reports dreams pertaining to fighting. During these fights, he attacks 
inanimate objects such as rocks and railroad tracks. He has many fears, the most 
prominent of which is fear of fire. He relates this fear to his early childhood when he 
received severe burns from stoves. In bed, he frequently thinks of fire and is fearful of 
what would happen should the building burn down. Memory of his present night dis- 
turbances is more acute than formerly.” 

Note: (To be referred again to the consulting psychiatrist re. medication.) 


Psychiatric examination. “Treatment of choice is continued training. However week- 
ends under better than average supervision may have some therapeutic value. I would 
discontinue benzedrine in order to note sleep cycle without medication.” 


Released for 10 days Christmas leave to parents and returned without untoward diffi- 
culty in community. 


Psychometric Study. Bender-Gestalt test repeated in order to note deterioration or 
enue from examination of 5-29. Most noteworthy findings on this test were improved 
performance on Form #3. (See Figure 4) and better handling of diamond figure. There 
are still some impairments within the visual motor sphere and mild distortion of spatial 
relationships, but on the whole, the performance is significantly improved. 


6: 




















Ficure 4. Benper Gestaut Test Resvutts on 1-8-53. 


Psychotherapeutic interview. “Sleep disturbances have abated to a perceptible degree. 
There is still blurring of vision when looking at something. 

Opinion: There is a improvement in the case of this youngster with history of 
peg gy com ost noteworthy is the marked pro in mental function specifi- 
cally in those areas where impairment was — y prominent. However, some 
evidence of cortical damage persists. If and when he is released to the community, he 
should be required to have a repeat EEG accomplished in 6 months. He is also in need 
of more adequate supervision than his parents are ee to give him and the case 
worker should be warned of the potentialities in his case for further asocial activities.” 
Psychiatric examination. “This youngster has been povineiy seen, most recently in 
November. Since that time, he has the privilege of spending 10 days at home during 
the Christmas holiday and returned without di os Adjustment in the school is 
reported to be improved. His sleep cycle is also bettered. I would have no objection to 
indefinite visit under better than average supervision and further require that he attend 
the Child Guidance Clinic in his home town.” 


Released to parents. 


per te Since above date, he has exhibited no behavior difficulties in the community 
(a period of 6 months). Long-term prognosis, however, is guarded. 
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SUMMARY AND CONCLUSIONS 

The case of a 13-year old juvenile offender with a diagnosis of encephalopathy 
has been discussed. The study is significant for two reasons. First, it appears possible 
to detect deficits in mental function from a test of general intelligence as well as from 
a diagnostic one. Both types of test in this study indicated that cortical damage 
existed which would make this boy appear mentally retarded despite his adequate 
native endowment. Second, the clinical findings emphasize the over-all problem 
which the severe behavior disturbance accompanying this syndrome presents to the 
community. If it could be possible to establish long-term treatment clinics in individ- 
ual communities for postencephalitic children, such cases would not appear in 
training schools which are specifically equipped to train only those behavior prob- 
lems whose asocial patterns stem from a situational source. 
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AN EXPERIMENTAL INVESTIGATION OF SOME SOURCES OF 
VARIANCE IN THE WHOLE RESPONSE TO THE RORSCHACH 
INK BLOTS! 


EDWARD J. KEYES 
Rohrer, Hibler and Replogle 
New York, N.Y. 


INTRODUCTION 


Psychological interpretation of Rorschach performance has been largely de- 
rived from an a priori foundation and has continued in usage on that basis rather 
than as a result of experimental verification. Attempts to demonstrate the validity 
of many of these interpretations have led to negative or ambiguous results“). Never- 
theless, it has been shown that not only is there measurable variability in perform- 
ance, but also that there is consistency within the individual in responding to the ink 
blots“; !®, The assumptions which have been made about the psychological mean- 
ing of response categories may or may not stand up under investigation. This does 
not necessarily invalidate the Rorschach method, but may require revision of its 
application. If the sources of variance can be identified, interpretation of Rorschach 
records may be based on experimental evidence and thus be more meaningful. 


PROBLEM 


The purpose of this study was to examine the whole response to the Rorschach 
Ink Blots to identify some of the principle sources of variance. The specific hypoth- 
eses tested in this investigation were as follows: 


1. W isa function of perceptual organization, as defined by performance on the Street Gestalt 
test. 


‘Adapted from a dissertation submitted in partial fulfillment of the requirements for the d 
of Doctor of Philosophy, in the Department of Psychology, in the Graduate College of the State Uni- 
versity of Iowa. The writer wishes to sy sy his appreciation to Dr. A. L. Benton for his supervision 


of this research, to Dr. W. W. Morris for many helpful suggestions and to Dr. H. P. Bechtoldt for 
advice about the experimental design. He also wishes to acknowledge the cooperation of the Psycho- 
logy Departments of Grinnell College and Carroll College in supplying subjects for the experiment. 
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Perceptual organization is a response tendency that is subject to learning and can be in- 
creased through training effected by practice in identifying mutilated pictures and measured 
by subsequent performance on the Street Gestalt test. 

Individuals trained in perceptual organization (identifying mutilated pictures) will respond 
with a greater number of W than those not so trained. 

Instructions for subjects to respond only to the whole of the ink blot will lead to a change in 
the set or tendency and to a larger number of W than for subjects not so trained. 

The number of W-+, i. e., whole responses judged to be of good form, will also increase with 
training and with instructions to respond only to the whole blot. 


PROCEDURE 


Subjects. Eighty-eight men and women undergraduate students enrolled in various 
psychology courses at the State University of Iowa, Grinnell College and Carroll 
College served as subjects in the experiment with an additional twenty subjects from 
Iowa serving as a special control group. A complete replication of the experiment 
was carried out at each of these institutions. 


Experimental Design. The experiment was designed as a complex factorial problem 
involving three factors, following the model presented by Edwards“. The factors 
varied were: 1) training in perceptual organization; 2) instructions to the subject for 
the Rorschach; 3) replication in different populations. The factors of Training and 
Instructions were each varied in two ways and there were three replications. 

All experimental procedure was conducted in groups. Each group spent one 
session in the experimental situation. For each replication the sequence of events 
was as follows: 


Treatment Group I (TI) Training - W only instructions 
1. Training 30 minutes 
2. Recess 5 minutes 
3. Rorschach 50 minutes 
Treatment Group II (TNI) Training - Usual instructions 
1. Training 30 minutes 
2. Recess 5 minutes 
3. Rorschach 50 minutes 
Treatment Group III (NTI) No training - W only instructions 
1. Warm up task 30 minutes 
2. Recess 5 minutes 
3. Rorschach 50 minutes 
4. Street Gestalt test 5 minutes 
Treatment Group IV (NTNI) No training - Usual instructions 
1. Warm up task 30 minutes 
2. Recess 5 minutes 
3. Rorschach 50 minutes 
4. Street Gestalt test 5 minutes 


The Rorschach was administered in general accordance with the group pro- 
cedure outlined by Harrower and Steiner“). Each slide was projected for one and 
one-half minutes in the original position, one-half minute on the right side, one 
minute upside down and one-half minute on the left side. Treatment Groups I(TI) 
and III (NTI) were instructed to respond only to the whole blot and Treatment 
Groups II (TNI) and IV (NTNI) were given instructions to respond to everything 
they saw. Subjects recorded their responses verbatim. After all the slides had been 
exposed, the blots were projected again and a location inquiry was made to determine 
whether the area responded to was the whole blot or a detail. This was done for the 
groups instructed to respond only to the whole area, as well as for the “‘usual in- 
struction” groups, because of the suspicion that even with a verbally induced set to 
see only W, an occasional detail response might be made by some subjects. 

The stimulus materials used in training consisted of two series of mutilated 
pictures similar to the Street Gestalt pictures. These were prepared by the author 
and reproduced on 35 mm film, each picture occupying a double frame. There were 
twenty-five pictures in each series, the first picture in each series serving as an 
example. 
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In training, each picture was exposed for approximately two seconds with ap- 
proximately ten seconds between pictures. Both series were administered in this 
manner, the subjects recording their responses. Following this, each series was re- 
peated slowly so that each picture was exposed long enough for every subject to 
report that he perceived the picture correctly. 

As a last step in training, the Street Gestalt test was administered in the same 
way as the two Training Series, except that the test was not repeated. The Street 
Gestalt pictures were those used by Thurstone in “A Factorial Study of Percep- 
tion’ ®?), There were twenty-seven pictures, also reproduced on 35 mm film. This 
test was administered to the non-trained groups after they had taken the Rorschach, 
to provide a measure of performance for evaluation of the effect of training. 

To evaluate more precisely the effect of training, two groups of ten each served 
in a special condition. For these groups the procedure was to give a warm up task for 
a twenty minute period of time followed by administration of the Street Gestalt 
test. Since the Street Gestalt test was the last part of the training procedure, this 
served to put the Street Gestalt test at the same relative time for the experimental 
groups and the control groups. 


RESULTS 


Scoring. The first measure of the performance on the Rorschach was W, the total 
number of whole responses, undifferentiated. The second score was W+, the num- 
ber of responses judged to be of good form. 

The Street Gestalt test was scored for accuracy according to the actual identity 
of the stimulus. The symbol SG was used to denote the number of responses made, 
and SG+ to indicate the number of correct responses. 


Analysis of Rorschach Performance. The summary of the complex analysis of variance 
for the number of W and the number of W-+ is contained in Table 1. Because the 
distributions of the scores were not markedly skewed and because the means and 
standard deviations were independent, it was not necessary to transform the data, 
so that the raw scores were used in the analysis. The criteria values of F to be ap- 
plied to Table 1 are as follows: 

F .01 (df = 1,76) 6.98 

F .05 (df = 1,76) 3.97 

F .01 (df = 2,76) 4.90 

F .05 (df = 2,76) 3.12 


TABLE 1. SummMARY TABLE FOR THE CoMPLEX ANALYSES OF VARIANCE FOR W AnD W+ 








Ww W+ 
Source of variation SS MS SS MS F 








Training 1099.103 1099.103 520.410 520.410 10.374** 
Instructions 525.284 525.284 142.546 142.546 2.842 
Colleges 716.414 358.207 257.183 128.592 2.563 
T x I Interaction 405.919 405.919 305.694 305.694 6.094* 


T x C Interaction 173.202 86.601 : 115.840 57.920 


2 
Ix C Interaction 2 95.216 47.608 54.150 27.075 
2 


TxIxC Interaction 266.856 133.426 , 160.678 80.389 


Within Groups 76 6184.722 81.368 3812.363 50.163 














+e 
* 


Significant beyond the one per cent confidence level 
Significant between the five and one per cent confidence levels 
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Examination of the data in Table 1 indicates that the main effect of Training 
is significant at the one per cent level of confidence, that the main effect of Instruc- 
tions is significant at the five per cent level of confidence, that there is a difference 
between Colleges which is significant at the five per cent level of confidence and that 
the Interaction between Training and Instructions is significant at the five per cent 
level of confidence. 


The significant difference between Colleges for the mean W score requires close 
examination. None of the Interaction terms involving Colleges are significant, nor 
does Colleges appear as a significant factor in the analysis of W+. Examination of 
the means and standard deviations provided in Table 1 suggests that the difference 
is one of levels. This may be due to chance, or to differences in the experimental 
conditions, or to systematic differences in the populations. The interpretation 
accepted was that this difference most probably was due to chance and no further 
analysis of the variation due to replication was made. 

From Table 1 it can also be seen that the effect of Training on W + is significant 
beyond the one per cent level and that the effect of the Training x Instruction inter- 
action is significant between the five and one per cent confidence levels. In this 
analysis, neither the effect of Instructions nor that of Colleges was significantly differ- 
ent from chance expectation. 

Evaluation of the differences between the means of the pairs of Treatment 
Groups clarifies the information about the principal sources of variance in both W 
and W+. Table 2 shows the values of ¢ calculated for the differences between pairs of 
Treatment Groups. 


TABLE 2. VALUES oF ¢t BETWEEN Patrs oF TREATMENT GROUPS FOR 
W anv W+ 








Ww W+ 


Groups Mean t df t 
Diff. 


I-II 9.132 2.980** 42 2.958** 








I-III 11.364 4.078** 42 4.007** 
I-IV 11.955 4.164** 42 3.419** 
II-III 2.182 .778 42 1.054 
II-IV 2.773 961 42 1.136 
ITI-IV 591 2.30 42 1.182 














** Significant beyond the one per cent confidence level. 


With df =42 for all pairs, the criterion for significance at the five per cent level 
of confidence is a value of t=2.017 and that for the one per cent level is a ¢ of 2.696. 
It is seen from Table 2 that for both W and W-+ the performance of Treatment 
Group I (TI) is significantly different from all other groups beyond the one per cent 
level of confidence and that all other differences are not significant. This group re- 
ceived both Training and Instructions to respond only to the whole area. It seems 
apparent that the important variable in producing differences in the number of whole 
responses is the interaction between Training and Instructions. 


Learning. The training effect was measured by testing for the significance of differ- 
ences between the means in performance on the Street Gestalt test between those 
who received training and those who did not. Table 3 contains a summary of the 
statistics employed in evaluating the effectiveness of training. 
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TABLE 3. Means, STANDARD DEVIATIONS, MEAN DIFFERENCES AND VALUES 
oF ¢ CALCULATED IN EVALUATING THE EFFEect OF TRAINING 








Groups Compared n Mean 8.D. i df 
Diff. 


3.318 5.484** = 86 





SG (Trained) 44 20.409 
SG (Not Trained) 44 17.091 


SG-+ (Trained) 44 17.011 
SG+ (Not Trained) 44 12.875 
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5.136 4.918** 
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SG (Trained) 44 20.409 
SG (Control) 20 12.900 


SG+ (Trained) 44 17.011 
SG+ (Control) 20 10.975 


4.509 5.803** 
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Significant beyond the one per cent confidence level. 


In the experimental groups, for SG, a value of t= 5.484 was obtained, which for 
df =86 is significant beyond the one per cent level. The difference between the 
means of SG+ for the same groups produced a ¢ of 4.918, which is also significant 
beyond the one per cent level of confidence. 

A special control group was used to check the possibility that a fatigue factor 
or a difference due to the Rorschach influenced the Street Gestalt performance of 
the untrained groups. This group received the Street Gestalt test at the same 
relative time as the groups which were trained. The performance of the control 
group was then compared with the performance of the trained groups. Applied to 
SG, at of 5.803 was obtained, which for df =62 is significant beyond the one per cent 


level of confidence. The mean difference for SG+ between the groups trained and 
the control group led to a value of t=7.347, also significant beyond the one per cent 
level. 


SUMMARY AND CONCLUSIONS 


The use of the Rorschach Ink Blots in this experimental procedure is by no 
means to be considered as equivalent to the usual clinical use of the Rorschach. 
Generalizing from the findings of this experiment to psychological interpretation of 
clinical Rorschach protocols must be cautiously approached. Individually admin- 
istered, the Rorschach is subject to many more variable factors than when admin- 
istered to groups. Only one response category was studied in this experiment and 
only one determinant, that of form. Since clinical interpretation is made in the light 
of all characteristics of a response as well as its relationship to other responses, it is 
unreasonable to state categorically that these findings may be directly applied to the 
Rorschach as a diagnostic instrument. 

However, the use of the Rorschach Ink Blots as stimuli in this kind of experi- 

mental approach has provided valuable information about the functional meaning 
of the whole response. As a result of the evidence presented in the previous sections, 
it may be concluded that W reflects both training in perceptual organization and an 
additional tendency or set to organize. These two factors interact, when present 
together, to produce an increase in the number of W and W+. 

A possible clinical implication of the evidence from this experiment is in evalua- 
tion of diagnostic categories in which W is a differentiating factor. An experiment 
designed to evaluate differences in responsiveness to training in perceptual organiza- 
tion and instructions might be useful in studying these conditions. 

The kind of experimental procedure followed in this investigation may be ap- 
plicable in studying other Rorschach response categories. The relationship of other 
perceptual factors to Rorschach performance offers a wide area of investigation. 
Ultimately, perhaps, this might lead to the clinical usage of a variety of simpler tests 
to explore more fully some of the diagnostic information obtained by the Rorschach. 
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THE EFFECT OF HYPNOTICALLY INDUCED HOSTILITY ON 
RORSCHACH RESPONSES! 


FRANK A. PATTIE 
University of Kentucky 


INTRODUCTION 


In a recent study Counts and Mensh®) used the Rorschach test to register 
changes produced in subjects (Ss) in whom hostility toward the experimenter (EZ) 
had been induced by posthypnotic suggestion. They inquired whether the test 
would reveal the same basic personality structure when administered under normal 
conditions, after induction of hostility, and on two subsequent occasions after re- 
moval of hostility. They were also interested in the white-space response (S) as an 
indicator of hostility. The data from five Ss under the four conditions were combined 
and subjected to a series of analyses of variance for several variables. The changes 
found were not statistically significant, but certain trends were noted. They make 
no mention of changes in the content of the responses nor the degree of similarity 
existing among the four protocols obtained from each S. 

Previous workers in this field were Sarbin “, who told a S that she was Madame 
Curie and then obtained from her numerous Rorschach responses relating to science, 
and Levine and others“), who studied the effect of various attitudes (not including 
hostility) in a female graduate student. 

The present study was begun with no particular hypothesis or prediction of re- 
sults in mind except that hostility or anger would lead Ss to perceive more objects 
of hostile import than they would under normal conditions. 


PROCEDURE 


The Ss were 14 undergraduate students, in all of whom post-hypnotic amnesia 
and responses to post-hypnotic suggestion had been observed. They are designated 
by numbers, and their sex is indicated by M and F. All but two Ss were students in 
an elementary course in psychology, in which hypnosis was not discussed. 

The procedure included two parts: 

(a) S was hypnotized and told that after coming out of the trance he would 
be given a test, which he would take “in a calm, normal frame of mind,” and that 
after finishing the test he would completely forget his responses to it. 

(b) S was hypnotized and told that after coming out of the trance he would 
feel ‘‘very hostile, aggressive, and angry” towards E, that E would then give him 
the ink-blot test, and that these feelings would continue until E said, ‘“This is the 
end of the test.”” After completion of the test S was asked non-leading questions 
about his feelings while taking the test. 

It was first planned to give (a) before (b) in half of the Ss and to reverse this 
order for the rest. However, after three varieties of response had been observed in 
the first seven Ss, this alternation was discontinued. At this point in the experiment 
(b) had preceded (a) for Ss 2, 3, and 12. It was then decided to use seven additional 
Ss (5, 6, 7, 8, 10, 13, 14) and to ask each S after completion of (a) this question: 
“If a person were to take this test while he was feeling very angry, hostile, and ag- 
gressive toward the giver of the test, what effect would these feelings have on his 
responses to the test?” 

The Rorschachs were scored according to Beck’s principles“. Responses of 
the following classes were considered indicators of hostility: 
wane’ Weapons, organs, and implements of aggression: bombs, guns, clubs, knives, claws, 

2. Violent forces: explosions, volcanoes, fire if involving danger or threat. 


‘This paper was read in part at the Southern Soc. for Philosophy and Psychology in April 1952. 
The writer is indebted to Dr. Murray Lonstein for help in scoring the Rorschachs. 





FRANK A. PATTIE 


3. Aggressive human beings or animals: demons, sadists, wild boars, lions, snakes, pre- 
datory birds. 

4. Blood. 

5. Activities of a hostile kind: wrestling, fighting, biting. 

6. Human beings and animals which had suffered some kind of aggression resulting in 
fright, mutilation, or death. 

7. Hostile expressions on the faces of human beings or animals. 

8. Confabulatory responses such as “a large rock about to fall and hit someone,” “an eagle 
carrying another bird in his claws” (but in the inquiry S says he does not see the other bird). 


If a response fell under more than one of the above categories (for example, a de- 
capitated body with blood on it), it was counted as only one hostile response. 

The degree of similarity between two Rorschach protocols (called communality 
in this paper) was found by taking the number of responses which were common to 
the two and dividing it by the smaller R. Thus, if 40 responses are given on one 
test and 50 on the other, 10 being common to both, the communality is 25%. 


RESULTS AND DISCUSSION 


1. All Ss reported at the end of the test (b) that they had had feelings of anger 
or hostility toward E. The reason for the anger given by all Ss but one was that # 
was wasting S’s time by giving a senseless or useless test; the wording differed among 
Ss, but there was a remarkable agreement among them. Subject 2, however, inter- 
preted her feelings as depression, saying, ‘‘I must have been looking on the dark side 
of things, seeing everything evil and unpleasant in the blots.” All Ss stated after 
completing the second test that they did not remember having taken it before. 

2. The Ss can be classified in three groups according to the changes produced: 
Group I, eight Ss who showed more than a twofold increase in percentage of hostile 
content; Group II, two Ss whose behavior was very uncooperative and involved re- 
jection of some cards and criticism of all of them, but did not involve any increase in 
hostile content; Group III, four Ss showing neither of these changes. Of the first 
seven Ss four were in Group I, one in II, and two in III; of the second seven (ques- 
tioned as to effects of anger) the distribution was exactly the same. Some of the 
data are given in Table 1. 


TaBLe 1. Some RorscHacH VARIABLES FOR CONDITIONS (a) AND (b) 
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3. The opinions of the Ss questioned about the effects of anger fell into three 
classes: (a) that there would be an increase in hostile content (5, 6, 7, 8), (b) that 
Ss would not cooperate (10), (c) that there would be no effect (14) or no effect but 
hurrying (13). These answers matched their subsequent Rorschach behavior in 
every particular save one. Ss 6 and 13 stated that S would hurry through the test. 
However, S 6 did not hurry, taking more time in (b) than in (a). S 13 decreased her 
time by 39% in (b). 

4. The number of responses decreased in hostility. The mean change is a de- 
crease of 8.9 responses. The probability of a difference of this size arising by chance 
is about .01. Eleven Ss decreased, three increased their productivity. The three 
Ss (2, 3, 12) for whom the order was (b) - (a) gave fewer responses in (b), their dif- 
ferences (9, 15, and 11 responses) being in every case larger than the mean difference. 

5. The results do not support the theory that the white-space responses in- 
dicate hostility or aggression, as stated by Rorschach“), who used the term “‘op- 
positional tendencies,” and more recently by Bohm: ?- 5), who states as a funda- 
mental rule that these responses plus color responses are found in manifest, extern- 
ally directed aggression (but a negative finding in this respect is not always a proof 
of a contrary condition), while 8 responses plus poor movement (M-—) responses 
occur in introjected or inhibited aggression. In relatively stable, intelligent Ss the 
former kind of aggression expresses itself mostly as criticism of the stimuli (Fremd- 
kritik), the latter as self-criticism (Selbstkritik). Bohm speaks®: »- '*8) of the S 
responses as representing “ ‘der Geist, der stets verneint,’ the negativistic principle 
of destruction, aggression.’”” Some skepticism as to this interpretation has arisen; 
Scharmann “) gives evidence tending to show that the S response indicates intel- 
lectual flexibility. 

In this experiment, five Ss increased the number of S responses in (b), five de- 
creased them, and four gave the same number in both conditions. If we use per- 
centages rather than absolute numbers, we find an increase of S% in (b) in seven Ss, 
a decrease in five, and no change in two. 

6. The results likewise support no theory as to an increase in color responses 
in hostility. These responses decreased in (b). The percentages occurring in the 14 
Ss are: (a) C 0.6, CF 3.5, FC 7.0; (b) C 0.4, CF 3.0, FC 5.2. Corresponding figures 
for Group I are: (a) C 1.0, CF 4.6, FC 8.3; (b) C 0.3, CF 4.3, FC 5.8. 

7. Since Counts and Mensh presented data on movement responses, they are 
given here also. Percentages were: (a) 4.7, (b) 3.4. For Group I the corresponding 
per cents are: (a) 11.7, (b) 12.3. 

8. The mean communality of tlie two tests was 34 per cent. This number may 
be compared with the results, reported by Griffith, of retesting four patients 
suffering from Korsakoff’s syndrome. The present writer’s estimates of the com- 
munality in the patients’ protocols are 56, 59, 82, and 90 per cent. The communali- 
ties of only two of the 14 Ss exceeded the lowest communality of the Korsakoff cases, 
of whom Griffith says “there can be no doubt that memory as the recall or recogni- 
tion of percepts was not operating.” If we take his data as showing the effect of an 
obliteration of memory traces upon the communality of test and retest, then we may 
plausibly conclude, in view of the much lower mean communality shown by the 
normal Ss, that most of the latter Ss acted as if they interpreted E’s instructions to 
forget their responses as if he had said that they were to suppress their previous 
responses to the blots and not allow them to emerge on the retest. 

9. The fact that the reactions of the second seven Ss were in harmony with 
their opinions as to what would happen in hostility does not prove that their be- 
havior was merely the cold-blooded acting of a role. Just as hostility is expressed 
in different cultures by means of different acts, gestures, and weapons, so it may be 
expressed by different Ss in various manners which are in agreement with their 
ideas as to the available modes of expression. 

10. Bohm’s statement, referred to above, about the occurrence of criticism of 
the blots (Fremdkritik) was borne out in the cases of Ss 9 and 10, both of whom were 
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very critical (‘““There’s nothing there but a messy blot,” “This is an insult to my 
intelligence”’) and rejected some of the cards. Fremdkritik was not a feature of the 
records of the other Ss. 


SUMMARY 


The Rorschach responses of 14 Ss were studied when the test was given (a) under 
normal conditions and (b) with posthypnotically induced hostility toward EZ. Ss 
were classified in three groups according to their reactions in (b): those who showed 
more than a twofold increase in percentage of hostile content in their responses 
(eight Ss), those who were uncooperative and relatively unproductive (two), and 
those who showed neither of these changes (four). The last seven Ss used in the ex- 
periment were asked, after being given the test under normal conditions, what effect 
would be produced by hostility toward EH, and their subsequent Rorschach responses 
were in accord with their opinions. The results offer no support whatever to the 
theory that white-space responses and color responses are indicators of outwardly 
directed hostility. This finding is in accord with those of Counts and Mensh. How- 
ever, this experiment goes further than their work by reporting three different types 
of response under hostility and by reporting and discussing the degree of communal- 
ity of the Rorschachs obtained under the two conditions. 
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FURTHER VALIDATION STUDIES OF THE WECHSLER MEMORY 
SCALE! 


ALVIN R. HOWARD 
Veterans Administration Hospital, Chillicothe, Ohio? 


INTRODUCTION 


Since the publication of an earlier article“) concerning the diagnostic value of 
the Wechsler Memory Scale (WMS), no additional studies have come to the writer’s 
attention. The aim of the present study was two-fold: (1) to determine whether the 
utility of the WMS could be improved, and (2) to cross-validate, to the degree poss- 
ible, the results of the initial investigation. 


PROCEDURE 


In the present study, prediction of “organic” involvement was based upon data 
obtained from an analysis of 126 cases. Half of these cases were diagnosed both as 
“psychotic” and ‘‘organic’’; the other half were diagnosed as ‘“‘psychotic” only. The 
former group included 35 paretics used in the initial study; omitted were ten en- 


1Acknowledgment is made to Dr. Geo. A. Kelly for his advice regarding statistical matters. 
2From the Veterans Administration Hospital, Chillicothe, Ohio. 
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cephalitics and 13 epileptics also employed at that time since the WMS provided 
statistically insignificant differentiation between them and their corresponding 
“non-organic” controls. The group of 63 patients who carried a psychotic diagnosis 
only was comprised mainly of patients who had served as controls in the initial in- 
vestigation. 

Thus data of 63 “organics” and 63 controls were available for pilot study. 
Twelve hypotheses resulted. Chi-square values for all but one of these were signifi- 
cant beyond the 1% level; the level of the exception was beyond 5%. The 12 hypoth- 
eses were drawn from the following 18 critical scores, lettered for convenient refer- 
ence to table 1 which cites the items comprising each hypothesis tested in this study 
and the corresponding chi-square values previously found in the pilot study. 


a. Age incorrect 

b. Alphabet below 3 

c. Counting by threes below 2 

d. Digits Total below 8 

e. Digits Total below 9 
Governor incorrect* 

Memory Passages Average below 2 
Memory Quotient below 75 

Mental Control below 2 

Orientation below 2 

Personal and Current Information below 44 
Personal and Current Information below 5¢* 

. Personal and Current Information plus Orientation, below 7 
Personal and Current Information plus Orientation, below 9 
Visual Reproduction above 7 
Visual Reproduction below 4 
Visual Reproduction below 6 
Visual Reproduction “C-1” scored zero 


TaBLE 1, ComposiTion, AND Pitot Resuuts, or 12 HyporHEses* 


3 

g. 
h. 
}- 

k. 
l. 

m 
n,. 
o. 
p. 
q- 
r. 








Hypothesis Items Xt 


1 d,h,l,q 14.58 
2 e, h, 1, q 21.68 
3 d, f, h, 1, p, r 8.58 
4t d, f, h, 1, p, r 18.51 
5 e, f, k 17.24 
6§ a, b, c, d, f, g, h, 1, n, p, r 37.85 
7 d 22.82 
8 25.74 
9 28.72 
10 8.23 
11 i 7.57 
12 j 4.31 
*.05 = 3.84 02 = 5.41. 01 = 6.64. 
tCorrection for small frequencies was applied where a cell entry was below five (2). 
tHypothesis 4 differed from Hypothesis 3 by its inclusion of persons with any five of the six 


items listed. 
§Here the criterion was six or more of the 11 items listed. 














_ A new sample of 50 patients, evenly divided between “‘psychotic-organic” (ex- 
perimental) and “psychotic” (control) persons, was selected. In almost every in- 
stance, individual matching was accomplished in respect to both age and duration 


‘As in the original study (3), this item was credited if the named Governor of the patient’s home 
state was in office during the current pee ee 

‘Here, too, the procedure of the initial research was followed. Item six (Mayor) was excluded from 
the scoring due to its inapplicability, the subtest score becoming prorated. 
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of hospitalization (computed from date of most recent admission.) The available 
educational data on 17 experimentals and 21 controls indicated adequate (group) 
matching on this variable.’ Deviating from the original study, no attempt to scale 
occupational achievements was made. Experimental and control group mean ages 
respectively were 53 and 51; corresponding hospitalization terms were eight years, 
two months and seven years, eleven months. 

Not only did each member of the experimental group possess an organic diagno- 
sis, but also most of them were reported to have had positive neurological findings 
upon latest examination. Each member of the control group possessed solely a 
psychotic diagnosis and, in addition, ‘“‘negative” or “essentially negative” neurologi- 
cal findings. Composition of the groups was as follows: 

Experimental Control 

3 General arteriosclerosis® 2 Catatonic schizophrenia 

1 Endemic encephalitis 15 Hebephrenie schizophrenia 

1 Lethargic encephalitis 1 Manic-depressive 

1 Toxic encephalopathy 1 Mixed schizophrenia 

2 Traumatic encephalopathy 4 Paranoid schizophrenia 

6 Prefrontal leucotomy‘ 1 Paranoid and mixed schizophrenia 

1 Multiple sclerosis 1 Schizoid personality 

1 Parkinson’s disease 

8 Syphilis, central nervous system 

1 Topectomy 


RESULTS 

Each hypothesis was subjected to two chi-square analyses. The first was de- 
signed to determine whether the predictions based upon the earlier sample of 126 
subjects would hold at similar levels of confidence in the current sample of 50 persons. 
Chi-square in the second instance was employed with reference to the null hypothe- 
sis. The values resulting from both analyses are presented in Table 2. The a priori 
hypotheses included the expectation that insignificant differences would result from 
comparison of the two samples mentioned above. From Table 2 it will be noted that 
only the chi-square values for a priort hypotheses 3, 10 and 12 are low enough to 
meet the criterion of insignificance. Except in one instance (Hypothesis 4), the re- 
maining values are well above the 1% level of confidence. Apparently, therefore, the 


Tas Le 2. Cui-SQuaRE VALUES UNDER A PriorI AND NULL ConpDiITIONS 








Hypothesis A priori Null* 





14.81 46 
17.59 -09 
.004 
6.33 
19.23 
24.35 
44.41 
22.64 
13.69 
2.84 
7.59 
2.79 


* 05 = 3.84. 02 = 5.41. .01 = 6.64. 











‘Experimental group median grade completed was nine; control median was eight. 
*Two patients additional diagnoses of cerebral arteriosclerosis. 
7One patient had an additional diagnosis of general arteriosclerosis. 
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present sample of 50 subjects differs markedly from the original sample in terms of 
most of the hypotheses advanced. 

Consulting the “null” column of Table 2, we note that Hypothesis 3 holds at 
better than the 5% level and Hypotheses 6 and 9 hold beyond the 10% level of con- 
fidence. Lower confidence levels are noted for the remaining hypotheses. 


Discussion 


As with most (if indeed not all) psychological techniques currently employed for 
diagnosing the ‘‘organic,”’ the WMS should be used not only with extreme caution, 
but also jointly with a continuing program of re-evaluation. It must be remembered 
that this scale emphasizes ‘‘“memory”’ and, for the most part, ‘“‘recent memory”’. 

It was pointed out that three hypotheses were supported at the a priori levels 
(Table 2); however, two of these three hypotheses did not stand up against the null 
hypothesis. Thus the findings in respect to Hypotheses 10 and 12 suggest that al- 
though the performance of the current sample is similar to that of its predecessor, 
the results do not permit rejecting the probability of a chance relationship. On the 
other hand, two hypotheses, although not supported at the a priori levels, “‘tended”’ 
to approach an acceptable degree of significance when tested against chance. There- 
fore the findings related to Hypotheses 6 and 9 suggest a difference between samples, 
but sustain a tendency to afford above-chance prediction. 

In one instance (Hypothesis 3) supportive results were found both at the a 
priori and null levels. Accordingly, the six “‘signs’’ comprising this hypothesis should 
afford the best single predictor of brain damage. 

If with WMS results, the clinician encounters difficulty recognizing the ‘‘chronic 
organic,”’ how will he fare with the recently brain damaged individual? May we 
reasonably expect that increasing chronicity is paralleled by increasingly impaired 
performance? Or does increased hospitalization decrease the difference in perform- 
ance between the “organic” and his ‘“‘non-organic’”’ counterpart? Does hospitaliza- 
tion affect the latter so that his performance becomes less distinguishable from that 
of the brain damaged individual? These are some of the possibilities associated with 
use of the WMS and similar techniques. Another problem is the “kind” of patient 
selected. Wechsler“ has said that the WMS “‘. . . should be useful in detecting 
special memory defects in individuals with specific organic brain injuries... .” Per- 
haps our sample inadequately meets this criterion. Additional studies, using patients 
differently diagnosed, would shed further light on this problem. Of two patients with 
the diagnosis of “traumatic encephalopathy,” one was selected by ten hypotheses; 
the other was not selected by any hypothesis. Cohen“ was unable to distinguish 
a group of variously diagnosed “organics” from groups of ‘‘psychoneurotics” and 
“‘schizophrenics” on 15 WMS variables. Continuing evaluation of the WMS as a 
diagnostic instrument is indicated. 


SUMMARY 


1. A further investigation of the utility of the Wechsler Memory Scale as a 
—- tool is reported. Some comparisons are drawn with an earlier study by the 
author. 

2. Twelve hypotheses designed to select persons with “organic brain damage” 
are cited. One of these indicates a satisfactory degree of predictive efficiency. Sev- 
eral of the remainder require additional study. 

3. It is suggested that the Wechsler Memory Scale be subjected to a contin- 
uing program of re-evaluation to define its usefulness in individual diagnosis. 
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RELATIONSHIP BETWEEN DIAGNOSIS OF PSYCHOSEXUAL 
PATHOLOGY AND THE SEX OF THE FIRST DRAWN PERSON , 


EMANUEL F. HAMMER, PH.D. 
Research Project, Psychiatric Institute, New York* 


PROBLEM 


Perhaps the most definitive statements made in the area of projective drawings 
are those concerning the interpretations gleaned from the sex of the first drawn 
Person. These interpretations have their basic roots in Schilder’s theory of the body 
image. In accord with this theory, the self-concept is said to be confused in areas of 
psychosexual identification when a subject draws an opposite-sex Person in response 
to the request ‘‘Please draw a Person’’, since the direction remains ambiguous in 
regard to the sex of the drawn person requested. 

Machover writes, ‘From the standpoint of sexual identification, it is assumed 
to be most normal to draw the self-sex first. From an empirical point of view, it is 
of interest that evidence of some degree of sexual inversion was contained in records 
of all individuals who drew the opposite sex first in response to the instruction, 
‘Draw a Person’.’’®: P. 1%) In spite of the readiness with which many clinicians who 
use projective drawings employ the interpretation of confused psychosexual identi- 
fication and /or homoerotic orientation when a subject draws the opposite-sex figure 
first, experimental or even large scale empirical support for this interpretation is 
still lacking. 

Evidence suggesting that confused sexual identification cannot be uncritically 
assumed from the drawing of a male Person first by a female subject is to be found 
in Mainord’s® data consisting of drawings from an unselected mental institution 
population in the same geographic area as the college students she employed as sub- 
jects. Whereas 57% of her female college subjects drew the female Person first, a 
slightly greater number (61%) of the institutionalized females drew the self-sex 
figure first. Since clinical experience suggests that sexual inversion is higher in dis- 
turbed individuals, the lack of any higher incidence of institutionalized subjects over 
“normal” subjects drawing the opposite sex figure first casts a shadow of doubt over 
the traditional interpretation made from such projective drawing behavior. 

In the area of drawings of male patients, Levy “ reports that it is usual for a 
selected group of homosexuals to draw the opposite sex figure first. His sample, how- 
ever, is a rather small one. Thirteen out of 16 overt male homosexuals drew the 
female Person first. Levy’s findings are apparently contradicted by Barker, Mathis 
and Powers“? who employed a larger sample (50 overt homosexuals). Forty-six of 
the 50 homosexuals drew the male Person first. Thus, the initial drawing of the 
female figure is not characteristic in the case of male homosexuals in an army en- 
vironment.? 

Since the evidence is rather conflicting that the sex of the first drawn Person 
might be used as an index of a subject’s sexual identification, the present study was 
designed to investigate the performance of 84 sex offenders currently imprisoned at 
Sing Sing prison. The gross psychopathology and confusion of sexual identification 
expected in sex offenders would make this a fruitful group with which to investigate 
the hypothesis of sexual inversion and /or sex identification confusion reflected in the 
drawing of opposite-sex figures. 


1Part of a study set up under the auspices of the New York State Commissioners of Correction 
and Mental Hygiene upon the recommendation of Governor Dewey. The purpose of the Research 
Project is to try to uncover the underlying dynamic causes of “sex crimes” and to ascertain the treat- 
ability of sex offenders. This task was undertaken at the New York State Psychiatric Institute and 
is under the directorship of Bernard C. Glueck, Jr., M. D. Grateful acknowledgment is made to John 
N. Buck, Bernard C. Glueck, Jr., Zygmunt A. Piotrowski and Selma Landisberg for their encourage- 
ment and fruitful comments. 

*Distortion of the female figure, however, differentiated the homosexuals from a control group. 
In addition, 23 of the 38 homosexual subjects who distorted female Ae yoy gave spontaneous hostile 
verbalizations. ‘The distortion of the female figure, when evalua in terms of the spontaneous 
hostile verbalizations . . . , reflects a basic hostility to the opposite sex, with whom the homosexual 
(presumably) is in competition.” @: ». 187) 
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SUBJECTS 


Eighty-four sex offenders served as subjects for the present investigation. 
Thirty-one consecutively admitted inmates at Sing Sing convicted of the offense of 
rape of an adult female comprised one sub-group, 33 consecutively admitted inmates 
convicted of sex offenses involving a female child under 15.5 years of age constituted 
the second sub-group, and 20 consecutively admitted individuals convicted of homo- 
sexual offenses involving a male child under 15.5 years of age constituted the re- 
maining sub-group. Thus, all the men studied were currently *‘sex offenders” in 
prison for an overt sexual act defined as illegal by current New York State Laws. 
The rapists’ ages ranged from 24 to 60 with a mean of 33.6, the heterosexual pedo- 
philes’ ages ranged from 23 to 63 with a mean of 34.3, and the homosexual pedo- 
philes’ ages ranged from 23 to 65 with a mean of 33.9. Close to 50% of the three 
sub-groups were recidivists. Inmates who had previous sexual offenses of a type 
other than the one for which they were presently serving time were not included in 
the subject groups. 


PROCEDURE 

The portion of the H-T-P’s under focus in the present study was administered 
as part of a complete psychological examination customarily given to the research 
project population at Sing Sing Prison. Each subject was requested to draw a 
House, a Tree, a Person and then a Person of the sex opposite to that drawn immed- 
iately prior. A separate sheet of 814 by 7 inch paper®: *) was provided for each 
drawing. 

The three sub-groups of sex offenders, rapists, heterosexual pedophiles and 
homosexual pedophiles, were then compared by means of Fischer’s“ ¢ test on the 
variable of opposite-sex-Person-drawn-first. 


RESULTS 


Four of the 31 rapists drew the female Person first; 4 of the 33 heterosexual 
pedophiles and 5 of the 20 homosexual pedophiles did likewise. Thus, 87.1% of the 
rapists, 87.9% of the heterosexual pedophiles, and 75.0% of the homosexual pedo- 
philes, drew the male Person first. Three-way comparisons revealed no significant 
differences among these sub-groups (see Table 1), although the overtly homosexual 
sub-group may tend somewhat toward greater frequency of female figure drawn first. 


TABLE 1. COMPARISON OF INCIDENCE OF FEMALE PERSON DRAWN First 








Number Drawing | Percentage Drawing 
Subject Group Female First Female First 


Rapists vs 
Heterosexual Pedophiles 





Rapists vs. 
Homosexual Pedophiles 


Heterosexual vs. 
Homosexual Pedophiles 

















DIscussion 


In a sense, the rapists may serve as a control group for the pedophiles in regard 
to such factors as the experience of having been detected, tried, convicted and in- 
carcerated for a sex offense as well as living in a more or less common prison en- 
vironment at the time of testing. Pascal and Herzberg have repeatedly found a lack 
of differentiation between rapists and a “normal” control group on various var- 
iables, concluding that “the controls and rapists are indistinguishable from each 
other”. “1, p. 8) Tn explanation they state: “If we take heterosexual behavior be- 
tween adults as our standard, then it is difficult to see how the rapists differ from the 
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controls in sexual behavior per se.’’“' »- *7) The rapists probably do differ from the 
pedophiles, however, on the variable of aggression—the majority of the pedophilic 
acts having been of a passive and seductive nature. Despite this difference, the 
rapists remain the best contrast group available for a study of other types of ‘‘sex 
offenders”. 

The present study shows no statistically significant difference between the 
pedophile groups and the rapist-control group in regard to which sex figure was 
drawn first in spite of the massive psychopathology and confusion of sexual identi- 
fication one would expect in a group of pedophiles.’ 


SUMMARY 


H-T-P’s were administered to 84 sex offenders at Sing Sing Prison, 31 of them 
incarcerated for an act of rape of an adult female, 33 for sexual advances toward 
female children, and 20 for sexual advances toward male children. After drawing a 
House and a Tree, the subject was requested to draw a Person, and then a Person of 
the sex opposite to that drawn immediately prior. Comparisons of the three sub- 
groups of sex offenders were made in regard to the percentage drawing the opposite- 
sex figure first in response to the ambiguous request to draw a Person. 

On the basis of the relatively small number of subjects employed, the following 
tentative conclusion appears to be suggested. Since only a small percentage of the 
psychosexually deviant groups drew a female Person first and, secondly, since the t 
test for this factor could not differentiate between the three kinds of sex offenders 
studied (including overtly homosexual and heterosexual groups), considerable doubt 
is cast on the projective drawing postulate that the sex of the first figure drawn may 
serve as an index of the subject’s sexual identification or as evidence of psychosexual 
conflicts or sexual inversion.‘ 
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As an alternative hypothesis in regard to the meaning of the first drawn Person, Piotrowski “*) 
suggests that the opposite sex figures “express, in free drawings as well as on the TAT and on the 
Rorschach human movements, personality traits which are not fully acceptable to the individual while 
the same-sex figure reflects the more accepted personality traits which are not so likely to cause anxiety 
or feelings of inferiority.”’ Piotrowski then relates the choice of the sex of the figure drawn to the degree 
to which the subject feels free in revealing his inner traits, conflicts and feelings. 

‘Since the preparation of this paper, the writer has received the following personal communica- 
tion from Buck, underscoring this view: “One would no more be justified in according an index-of- 
sexual-identification status to the sex of the first-drawn Person than one would be justified in granting 
similar paramount status to any other manifest (as opposed to latent) content item. Certainly it is 
not so much that the subject draws a man or a woman first, as it is how he draws it, the values he 
assigns to its component parts and their relationship, and the value that its content has for him.” 





AN EMPIRICAL STUDY OF THE ETIOLOGY OF DENTAL FEARS 
EDWARD JOSEPH SHOBEN, JR., PH.D. LOREN BORLAND, D.D.S. 
Teachers College, Columbia University Berkeley, Calif. 


PROBLEM 


This study is a result of the widespread and repeated observation that dental 
patients respond affectively in quite varied ways to the dental situation. Essentially 
the same dental procedures, carried out under essentially the same objective condi- 
tions, give rise to emotional responses ranging from something like near enjoyment 
in a few cases to intense discomfort and fear in many others. The practical con- 
sequences of this fearfulness in large numbers of people are two: The fearful patient 
(a) is not able to cooperate fully in treatment, and (b) tends to avoid dental care 
of the most routine sort, thereby permitting the development of otherwise prevent- 
able tooth and gum pathology. 

In spite of its importance in both preventive and therapeutic dentistry, the 
factor of emotionality has been strikingly neglected in the literature. Wile recog- 
nized the problem over twenty years ago and offered various techniques of reassur- 
ance by which to deal with it; but he offered no information as to what the origin: 
of dental fear might be, nor did he base his recommendations on anything resembling 
careful study. Richardson®? in 1936 discussed fearfulness in dental patients, draw- 
ing up the “laws of Fear” and suggesting that if these were studied and applied to 
professional practice, the problem would disappear. He did not specify, however, 
just what his “laws” were or how they were derived. 

It remained for Ryan“? to recognize fully and to give articulate voice to the 
problem of dental fears, pointing out that the emotional reactions of the patient 
which militate against effective treatment are best conceived as functions of the 
patient’s personality as it has been determined by his experience. Further, he stress- 
ed the need for controlled study rather than an intuitionistic, cut-and-try approach. 
The current investigation is in part a response to this challenge. It represents re- 
search of a preliminary and exploratory kind only, however, and may be useful 
primarily in pointing the way toward more definitive studies. 

Specifically, it was assumed that dental fears are acquired rather than innate, 
and a number of hypotheses were generated on the basis of various rationales as to 
the types of experience or personal characteristics that might be significantly asso- 
ciated with such fears. Four classes of hypothesis were developed: (a) those having 
to do with pain tolerance; (b) those having to do with traumatic experiences; (c) 
those having to do with parental attitudes and family background; and (d) those 
having to do with certain aspects of personality. The precise hypotheses are as fol- 
lows: 


1. Fearful dental patients have a lower general pain tolerance than do 
nonfearful patients. 

2. Fearful dental patients have undergone traumatic dental experience 
more frequently than have nonfearful patients. 

3. Feariul dental patients have undergone traumatic medical experiences 
more frequently than have nonfearful patients. The reasoning here simply 
holds that there is a kind of equivalence between dentists and physicians for 
many individuals in American culture and that by generalization®) acquired 
fears of physicians and the medical situation may be evoked by dentists and 
the dental situation. 

4. Fearful dental patients have suffered severe facial injuries more fre- 
quently than have nonfearful patients. 

5. Fearful dental patients come from families whose dental experience 
has been unfavorable. 

6. Fearful dental patients come from tamilies in which unfavorable atti- 
tudes toward dental work are typically expressed. 





EDWARD JOSEPH SHOBEN, JR., PH.D. AND LOREN BORLAND, D.D.S. 


7. Fearful dental patients are characteristically more anxious than non- 
fearful patients. Here the basic proposition is that the in-part inherent and the 
certainly widely talked about painfulness of the dental situation facilitates the 
cathexis of anxiety with it. 

8. Fearful dental patients have greater difficulty with authority figures 
than do nonfearful patients. The assumption underlying this hypothesis is that 
dental care involves the patient’s submitting to the manipulations of an author- 
ity. For those whose relationships to authority are marked with conflict, such 
a situation may elicit hostility and the very fearfulness and uncooperativeness 
of which dentists complain. 

9. Fearful dental patients tend to attach greater importance to personal 
appearance than do nonfearful patients. 

10. Fearful dental patients tend to place a higher value on oral activity 
than do nonfearful patients. From psychoanalytic theory? it follows that 
individuals fixated at an oral level and to whom the oral zone is particularly im- 
portant in their personality economy may unconsciously regard dental pro- 
cedures as a form of attack and therefore respond with anxiety. 

11. Fearful dental patients tend to be more dependent than nonfearful 
patients. Allied to Hypothesis 10, this one derives from psychoanalytic notions 
about the relationship of oral fixation and dependency. 


It can readily be seen that these hypotheses are not independent and that they 
are stated in crude, broad terms. They serve, however, as guides to a first approach 
to the investigation of a problem that seems psychologically important both theoret- 
ically in psychosomatic medicine and practically in dental practice. 


PROCEDURE 


The subjects consisted of two groups of patients drawn on a volunteer basis 
from a private dental practice in a midwestern city of moderate size. The “fearful” 
group was composed of fifteen patients who displayed extremely intense emotional 
patterns in the dental office, reported themselves as quite afraid of the dental situa- 
tion, and in thirteen instances asserted that they had delayed getting dental at- 
tention because of their fear of the prospect. The “non-fearful” or control group 
was made up of fifteen patients who exhibited little or no objectively observable signs 
of emotional distress in the office, denied fear, and in no case admitted to having de- 
layed making appointments when they were thought advisable. The purpose of the 
investigation was told the subjects, and it was made clear that they would be asked 
to give a good deal of rather intimate information about themselves which would be 
kept in strictest confidence and which would not in any way prejudice their status 
with the dentist. They were also told that their cooperation was being sought to 
help clarify an important problem but that they were under no duress whatsoever 
to participate. Every effort was made to avoid pressure in soliciting the subjects. 
Each group contained nine women and six men. The age range was from 19 to 38 
in the fearful group and from 18 to 41 in the controls. In both groups the median 
educational level was 14 years. 

Each subject was given an intensive two-hour interview, designed to elicit in- 
formation relevant to the areas indicated in the hypotheses. The interview was of a 
semistructured sort, with the interviewer raising topics and pursuing them within 
the framework of the purposes of the study, but allowing the subject freedom to 
elaborate on his responses and to discuss other material than that asked for. It was 
felt that cooperation was excellent and that the information gained was essentially 
reliable in spite of its being acquired in a single interview. No formal check on re- 
liability, however, was made. 

In every case, the subjects discussed at some length their reactions to painful 
episodes in their histories, their relationships with professional people with special 
reference to dentists and physicians, and their recollections of instances of dental 
treatment involving their parents or siblings. They were urged to keep this matter 
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of remembered evenis separate from another topic, their perception of parental atti- 
tudes toward dentists and dentistry. In addition, subjects were queried about their 
general fears and anxieties and worries; about their conflicts with teachers, em- 
ployers, military and civil authorities, and other authority figures; and about their 
emphasis on dress, grooming, cosmetic care, and other matters of appearance. Finally 
information was sought regarding the degree of pleasure gained from eating, drink- 
ing, smoking, talking, and using the mouth and oral area in sexual activities; and 
time was given to discussion of the extent of reliance placed by subjects on their 
parents, spouses, or other protective figures. Even in these more charged areas of 
investigation, the subjects responded with a surprising frankness. 

Near-verbatim transcripts of the interviews were coded to conceal the identity 
of the subject and were then reviewed by two psychologically sophisticated judges 
who were asked to rate each case either ‘‘yes” or “no’”’ with respect to each of the 
eleven factors named in the hypotheses. This procedure, of course, is tantamount to 
rating on a two-point scale or asking the judges to determine whether or not each 
subject could properly be characterized as having, for example, low pain tolerance, 
or a history of traumatic dental experience, or the experience of being exposed to 
unfavorable family attitudes toward dentistry, or a high anxiety level. This method 
of assessment is certainly crude, but it was felt that there was no point in attempting 
to refine the judgmental scale at this stage of preliminary research because of the 
small number of cases used, the generality of the hypotheses, and the qualitative 
nature of the data obtained. 

When the two sets of independent ratings were compared, disagreements oc- 
curred in only sightly over five per cent of the 330 judgements made. In eighteen 
instances, the judgments were simply omitted from the computations. 


RESULTS 


The results are presented in Table 1. The chi-square test, corrected for dis- 
continuity, was used to test each of the eleven hypotheses. With the .05 level of con- 


TaBLeE 1. SumMaRy OF Factors AssociaTED WITH DENTAL PaTIENTS’ FEARS 








Factor Non-Fearful Fearful Chi 
Yes No Yes No Square 





Low Pain Tolerance 1 13 7 8 2.35 


Traumatic Dental 
Experience 11 é 3.34 


Traumatic Medical 
Experience 3.75 


Traumatic Facial 
Experience 


Unfavorable Family 
Dental Experience 


Unfavorable Family 
Attitude Toward 
Dentistry 

High Anxiety Level 
Trouble with Authority 


Emphasis on Appear- 
ance 


Emphasis on Orality 
Dependency 
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fidence accepted as establishing significance, only hypotheses 5 and 6 appear to be 
confirmed.' Both of these propositions have to do with family attitudes toward dental 
treatment. Since one would expect these two predictions to be correlated, it is not 
surprising that they yield similar values of chi-square, but it is important to note 
that their both being significant lends weight to the notion that experience within 
one’s family is highly important in determining whether or not one will react fear- 
fully to dental procedures. 


DISCUSSION 


In the light of these findings, it seems legitimate to conclude that the attitudes 
and experiences of one’s family in relation to dentistry seem to be a most important 
factor in determining whether an individual will react with anxiety to the prospect 
of dental treatment and will therefore tend to avoid oral care for a detrimentally 
long period or to be emotionally disturbed and uncooperative in the chair. People 
come to the dentist set to respond with tension and fear chiefly because of the way 
dentistry has been represented to them in their homes. 

There may be an important relationship between this relatively clear-cut con- 
clusion and the rejected hypotheses dealing with traumatic experiences. What may 
well happen is that traumatic experiences involving the facial area or with physicians, 
as well as directly with dentists, may emotionally sensitize individuals who then as 
parents convey their own fears to their children through the amorphous but effective 
emotional learning situations the home provides. 

In any event, it seems likely that for the present generation of dental patients, 
at least, the problem of dental fear is not specific to the dental situation. Rather, it 
is closely bound up with the attitudinal transmission of anxiety through the child’s 
interactions with significant figures in his social environment. 


SUMMARY 


When the interview data of a group of fearful dental patients are compared with 
respect to eleven hypotheses with those of a control group of nonfearful patients, the 
significant element in the etiology of dental fears appears to be the attitudes of the 
patient’s family. Rather than being specific to the dental situation, dental fears 
seem to have their source in the interactions of the developing individual with his 
parents and other significant persons. 
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It is wie possible that these two significant chi-squares could have been obtained by chance 
Oo 


alone out of the series of eleven test run. On the other hand, these predictions were made in advance 
on quite rational grounds, and the factors making for unreliability in this exploratory study are such 
as to obscure any but the most stable trends in the data. 





SOME CHARACTERISTICS OF THE MANIFEST DREAM CONTENT OF 
MENTAL DEFECTIVES* 


MANFRED F. DE MARTINO 
Southbury Training School 


INTRODUCTION 


A review of the literature discloses extremely little research on the dreams of 
mental defectives. Walsh ® and Sarason “) appear to be the only investigators who 
have given this problem any serious consideration. This paucity of research may be 
due to the rather general attitude that the dreams of defectives offer very little as 
an aid in psychodiagnosis or psychotherapy. Extensive clinical experience with 
mental defectives, however, has led the writer to question the validity of this belief. 
For while the mental level of the defective may, in all likelihood, limit the scope of 
his dream life, it would seem that his lack of sophistication and relatively simple 
thought processes should result in dreams which are open, direct and easily inter- 
preted. Assuming this to be true, the dream life of the mental defective could offer 
another effective means of revealing his basic feelings, desires, interests, fears, 
frustrations, etc. The present study, therefore, was undertaken in an attempt to 
—- the nature of the dreams of a group of institutionalized mental de- 
ectives. 


METHOD 


The subjects consisted of 50 defective males and 50 defective females at the 
Southbury Training School, ranging in age from 14 to 18 years. The mean age of 


the males was 16.5, and the mean age of the females was 15.5. The IQ’s of each 
group ranged from 55 to 72. The mean IQ was 64 for the males and 65 for the females. 
All 1Q’s were based on results from the Revised Stanford-Binet, Wechsler-Bellevue 
Intelligence Scale for Adolescents and Adults (Form 1), and the Wechsler In- 
telligence Scale for Children. The participants were composed of those diagnosed as 
“Familial” and “Undifferentiated.” Epileptics, spastics, psychopaths, and individ- 
uals considered to be severely disturbed, were excluded. Each of the 100 subjects 
was personally interviewed by the writer. An additional ten subjects were inter- 
viewed but are not included in this study as they reported never having dreamed. 
During the interview, after a sufficient degree of rapport had been established, each 
subject was asked if he knew what a dream was. Those who responded satisfactorily 
and indicated that they dreamed were then asked to tell what they dreamed about 
most often. In a very small percent of the cases, encouragement and reassurance 
were necessary by the investigator in order to relieve the subject’s feelings of anxiety 
and embarrassment. After the participant had fully completed his self-directed 
dream reporting, he was asked to respond to a questionnaire. This was done in an 
effort to elicit certain aspects of dreaming which the subject might have overlooked 
in his initial reporting, or for some reason or other might have been reluctant to 
reveal. Whenever the respondent did not seem to fully understand the meaning of 
the question, time was taken to present an explanation which it was felt would be 
understood. Because of the limited capabilities of these subjects, they were simply 
asked to respond to the questions in terms of ‘yes’ or ‘no’, rather than in terms of 
frequency (See 1). 

The data obtained from the self-directed dream reports were treated in the 
following manner: Based on an overall observation of the collected data, various 
categories were established such as “Going Home’’, ‘‘Home’’, ‘‘Father’’, ““Mother’’, 
“Getting Out’’, ‘““Animals’’, ‘Murder’, “Sports’’, ‘Activities’, etc. The reports were 


*The author wishes to thank Dr. C. Edward Stull, Director of Psychological Service, Southbury 
Training School, Southbury, Connecticut, for his invaluable aid in the statistical computations and 
general experimental design. 
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then analyzed in terms of their specific content. For example, each time a reference 
was made regarding one’s father, a tally was recorded under the category “Father’’. 
Isolated and unusual responses were placed under a category labelled ‘“‘Miscellan- 
eous’”’. In presenting the results, essentially similar categories were combined and 
treated as one; e.g., “Home, Going Home”, (See Table 1). After the data were 
assembled, chi-square values were computed to determine the significance of sex 
differences. 


QUESTIONNAIRE* 
Do you ever see colors in your dreams, e.g., red, green, yellow, blue? 
Do you ever have a dream in which you are falling, or you fall off of something? 
Do you ever have very bad dreams that make you feel afraid? 
Do you ever have a dream in which somebody or something is chasing you? 
Do you ever have a dream in which you are flying, e.g., in a plane, through the air? 
Do you ever dream about the same thing? 
Do you ever dream about finding money? 
Do you ever dream about dying, or that somebody died? 
Do you ever dream about running away from here? 
Do you ever dream about eating food? 
Do you ever dream about hitting somebody? 
Do you ever dream about girls? 
Do you ever dream about boys? 
Are most of your dreams good or bad? 
Do you remember most of your dreams, or do you forget most of them? 


PPNM MP wpm . 


*The wording of the questions varied somewhat with the functioning intelligence of the subject. 


RESULTS 


Table 1 contains those dream categories which included the responses of one- 
third or more of either sex. Chi-square values and levels of confidence, as well as 
the number of responses in each major category are also indicated. An examination 
of this table reveals the following: (a) More of both sexes report dreaming about 
“Home” and “Going Home” than any other motif. (b) The greater majority of 
both sexes report dreaming about their parents. (c) Dreams about “getting out’’, 
and ‘‘Placement”’ are reported by half of the males and almost half of the females. 
(d) Significantly more males than females report dreams of aggression. (e) Signifi- 
cantly more males than females report dreaming about males. (f) Significantly more 
males than females report dreaming about sports activities. 


TABLE 1. Sex DirreRENcES IN Major CaTEGORIES* 
(BasED ON SELF-DirEcTED DREAM REpoRTs) 








Male (N =50) Female (N =50) 
Number of Number of 
Major Categories Responses Responses Chi-Square 





Home, Going Home 42 39 0.58 
Mother 35 1.33 
Father 32 2.01 
Getting Out, Placement 21 0.64 
Other Family Members 19 0.17 
Girls 19 0.40 
Aggression 11 5.47 


Boys 8 6.14 














Sports Activities 0 23.46 .O1 
| 





a *Only those categories are presented which contained the responses of one-third or more of 
either sex. 
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The following dream categories contained fewer than one-third of the responses 
of either sex: Animals, Kissing and Love, Job and Working, Television Programs, 
Various Personalities, Movies, Cowboys and Indians, Teacher and School, Falling, 
Being Chased, Dancing, Church and God, Clothes, Cars. 

Table 2 contains the number of positive responses (except for questions 14 
and 15) to the questionnaire, along with chi-square values and levels of confidence. 
From an examination of this table it can be seen that significantly more females 
than males report experiencing: color (question 1) in their dreams“: 5 %, falling 
(question 2), dreams“: ©), and dreams of being chased (question 4). On the other 
hand, significantly more males than females report dreaming about: finding money 
(question 7), food (question 10) and hitting people (question 11). 

Table 2 also reveals the following: (a) More than half of both sexes report ex- 
periencing fearful dreams (question 3). (b) Flying dreams which occur rather in- 
frequently are reported by almost the same number of males and females (question 
5). (c) An equal number of males and females report experiencing recurrent dreams 
(question 6). (d) Less than one-third of both sexes report dreaming about death 
(question 8) or running away (question 9). (e) A greater number of males report 
dreaming about females than vice versa (questions 12 and 13). (f) More than three- 
fourths of both sexes report that most of their dreams are predominantly good 
(question 14). (g) More than half of both sexes tend to forget most of their dreams 
(question 15). 


TaBLe 2. Sex DirFrerRENcES IN RESPONSE TO QUESTIONNAIRE 








Questions 


Male (N = 50) 
Number of 
*Yes’ Responses 


Female (N =50) 
Number of 
‘Yes’ Responses 


Chi-Square 





(Color) 14 
(falling) 17 
(feel afraid) 34 


5.09 
4.89 
0.05 
5.47 
0.06 
0.00 
6.83 
0.00 
0.27 
16.35 
4.59 
0.16 
3.40 


(chasing you) 11 
(flying) 11 
(recurrent) 2 

(finding money) 17 
(death) 


(running away) 


omonroaonwntk WN = 


— 
o 


(eating food) 34 
(hitting somebody) 21 
(about girls) 27 
(about boys) 24 
Good 
40 


— pa 
no =_ 


(Good vs. Bad) 0.06 


Remember 
(Remember vs. Forget) 17 














Remember 
16 





DIscussIon 


Especially noteworthy is our finding that only ten subjects of those interviewed 
reported never having dreamed. A qualitative evaluation of the self-directed dream 
reports tended to denote a relationship between IQ ratings, level of emotional 
maturity, and specific dream content. Subjects with [Q’s at the upper end of the 
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range tended to report dreams of a somewhat more complex and mature nature than 
those with IQ’s at the lower end. Significant, too, was the finding that while the 
subject’s IQ was in part related to the ease with which he described his dream, his 
personality structure seemed to be of greater importance. In the main, “extraverts”’ 
reported their dreams more readily and with greater detail than did “‘introverts”’. 
In view of the overall findings it is strongly felt that one might well utilize the 
dreams of mental defectives to the same advantage and in much the same manner as 
other projective data. This is particularly so in those instances where, either be- 
cause of a low level of functioning intelligence, lack of sufficient imagination, or 
some personality factor, the subject is unable to adequately respond to such devices 
as the Thematic Apperception Test, Rorschach, Symonds Picture-Story Test, ete. 
In clinical work with mental defectives, the writer has observed on a number of oc- 
casions that, whereas comparatively little data could be secured through the ad- 
ministering of the standard projective techniques, much valuable information could 
be uncovered through having the subject describe his dreams. It is quite possible 
that dream reporting for some defectives represents less of a psychological threat 
than does the responding to certain projective tests. 


SUMMARY AND CONCLUSIONS 


An attempt was made to investigate the nature of the dreams of a group of 
institutionalized mental defectives. The subjects consisted of 50 males and 50 
females between the ages of 14 and 18. The range of 1Q’s for the overall group was 
55 to 72. In a personal interview each subject was asked to simply tell what he 
dreamed about most often. Following this, the participant was asked to respond to a 
questionnaire. In view of the reported results, certain tentative conclusions seem 
warranted. 

1. Most mental defectives experience nocturnal dreams. 

2. Inasmuch as more of the subjects reported experiencing dreams about ‘‘Home”’ 
and ‘‘Going Home” than any other motif, the dreams of mental defectives tend 
to support Freud’s dream theory of attempted wish fulfillment. (Institutional- 
ized defectives very frequently express a desire to go home). 

The element of fear plays a role in the dreams of many mental defectives. 

The dreams of defectives are diversified. 

The dreams of defectives are predominantly pleasant. 

Defectives tend to forget most of their dreams. 


A number of significant sex differences exist in the dreams of mental defectives. 
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COMPARATIVE STUDY OF A SERIES OF TAT AND CAT CARDS! 


BERNARD H. LIGHT 
West Virginia University 


INTRODUCTION 


In 1949 Bellak®) introduced the Children’s Apperception Test which consists 
of a series of ten animal pictures based on the rationale that children between the 
ages of three and ten will more readily identify themselves with animal figures than 
with human figures“. There is some justification for this assumption in Freud’s 
Analysis of a Phobia in a Five Year Old Boy, and totemism, myths, fables, children’s 
dreams, and Rorschach responses ©: ® 4 7, 10, 1, 12, 1), 

In spite of this theoretical foundation, there has been little empirical research 
to confirm Bellak’s assumption. Bills utilized ten TAT cards and ten cards contain- 
ing rabbit figures, concluding that ‘‘it appeared easier for the children to formulate 
stories for the animal cards than for the TAT cards’: »- 2), Biersdorf and Mar- 
cuse ©), substituted human figures on the CAT for the animal figures and utilized 
the same scenes, situations, and shading effects so that an equivalent set of human 
figures was had for six cards. Biersdorf and Marcuse were not, however, able to find 
the differences Bills reported, and they concluded ‘‘that there was no quantitative 
difference between data collected from the stories of pictures containing animal 
figures and those containing human pictures’’. 

It appears that previous studies have not been able to determine, with any de- 
gree of conviction, whether there is greater productivity in response to pictures con- 
taining animal figures than there is in response to pictures containing human figures. 
Neither Bills nor Biersdorf and Marcuse have been concerned with the dynamic 
aspects of personality which would be revealed in such elements as story themes, 
conflicts, and feelings. Quantitative differences based on such criteria as length of 
time before response, length of response time, number of promptings and rejections, 
do not seem to be valid indicators of identification. This study is, therefore, an at- 
tempt to utilize the dynamic aspects of the stories in order to determine if pre- 
adolescents will more readily identify themselves with animal figures than with 
human figures. 


METHOD 


Seventy-five fourth and fifth grade children, 38 boys, 37 girls, with ages ranging 
from 9 years to 10 years, 6 months, the mean age being 9 years, 8 months, served as 
subjects. 

The group administration method was used, and five TAT and five CAT cards 
were alternately projected on a large screen, via an opaque projector, in the follow- 
ing order: 2CAT, ITAT, 3CAT, 2TAT, 7CAT, 3BM-TAT, 8CAT, 7GF-TAT, 
10CAT, 12M-TAT. The subjects were instructed, as a group, to write a dramatic 
story for each of the cards, with special reference being given to present happenings, 
past happenings, outcomes, and feelings. 

Each picture was exposed for a period of seven minutes (the subjects having 
been informed of the length of exposure time beforehand), and the subjects were en- 
couraged to complete each story in that interval. A total of 750 stories were ob- 
tained in this manner. 

The stories were evaluated and analyzed both quantitatively and qualitatively 
with reference to the following six criteria: (1) Number of words, (2) feelings, (3) 
conflicts, (4) outcomes, (5) themes, and (6) figures introduced. Two psychologists 
made comparative evaluations of 100 stories for purposes of reliability, and a value 
of .88 was obtained. 


~-IThis paper was presented in part at the meeting of the Midwest Psychological Association, 
Chicago, Illinois, May 1953. 
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RESULTS 


The results, as presented in Table 1 show clearly that there are significant differ- 
ences between the animal and human stories in five of the six criteria. 


TaBLE 1. DIFFERENCES IN RESPONSES OF 75 FouRTH AND FirtrH GRADE 
CHILDREN TO ANIMAL AND HuMAN STORIES 








Criteria Chi-square Significance 





Number of words Not significant* 
Feelings 
Number 1% 
Kind 1% 


Conflicts 
Number 1% 


Outcomes 
Number : 10% 
Kind , 1% 


Themes 
Number 5. 2% 
Kind : 1% 


Figures introduced 
Number 6 1% 











*Derived by the use of ¢ 


The direction of these differences is best expressed in terms of percentages which 
are as follows: 


Feeling. Sixty-one per cent of the animal stories lacked feelings while only 
35% of the human stories lacked feelings. In those stories containing feelings, 31 
different kinds were revealed in response to the animal cards while 51 different kinds 
were revealed in response to human ecards. 


Conflicts. Nineteen per cent of the animal stories contained anthropomor- 
phized types of conflict while 33% of the human stories contained conflict. 


Outcomes. Thirty-two per cent of the animal stories and 26% of the human 
stories lacked outcomes. Of the animal stories containing outcomes, 32% were 
classified as being ‘“‘different” in content and 36% the “same” in content. Of the 
human stories containing outcomes, 60% were classified as being ‘‘different’’ in 
content and 14% the “‘same”’ in content. 


Themes. Sixty-five per cent of the animal stories were regarded as having the 
same themes and 35% different themes. Thirty-eight per cent of the human stories 
were regarded as having the same themes and 62% different themes. 


Figures Introduced. Eighty-three per cent of the human stories contained 
mothers, fathers, sibs, peers, or other human figures, but only 44% of the animal 
pictures contained such identifications. 


DISCUSSION 


The results of this study do not support Bellak’s assumption, and they are in 
disagreement with the findings of both Bills and Beirsdorf and Marcuse. The super- 
iority of the human cards was demonstrated, except with reference to story length 
and number of outcomes, criteria which may have been affected by the time allotted 
for writing the stories. Thus there is an indication, from the age group utilized, that, 
in terms of the criteria employed, there is better identification with human figures 
than with animal figures. 
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SUMMARY AND CONCLUSIONS 


This study was an attempt to test Bellak’s assumption that pre-adolescent 
children would more readily identify with animal figures than with human figures. 
To this end, five CAT and five TAT cards were alternately projected on a screen 
and a group of 75 fourth and fifth grade children with a mean age of 9 years 8 months 
were asked to write stories in response to the pictures. The stories were compared 
and analyzed quantitatively and qualitatively with reference to the six criteria of 
length, feelings, conflicts, outcomes, themes, and figures introduced. Contrary to 
Bellak’s assumption, this study suggested that for this particular age group, there 
was better identification with human figures than with animal figures. 
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A FACTORIAL STUDY OF BODY TYPES 
MAURICE LORR AND VICTOR FIELDS 


Veterans Administration’: * Bureau of Naval Personnel, Washington, D.C. 


PROBLEM 


The study of human physique in the hope of throwing light on personality, in- 
telligence, growth, and disease, has been pursued since the time of Hippocrates. 
Numerous factorial studies directed at the isolation of basic morphological para- 
meters have been reported. Among the more recently reported analyses are those by 
Thurstone“?, Moore and Hsu ®?, and Heath’. 

The grouping or classification of human physique in accordance with patterns 
of morphological components has been most recently explored by Sheldon and his 
co-workers“. The classifications or typologies prepared by Rostan, Viola, Naccarati 
and Kretschmer®? are well known. While these classifications have much in com- 
mon, agreement as to the number and nature of the groups is not particularly con- 
spicuous. Sheldon, for example, presents some 76 somatotypes“?; others propose, on 
the basis of somewhat different criteria, three or four body types. It was the purpose 
of this study to test, by means of an obverse factor analysis, the hypothesis that 
there are three differentiable independent groups into which the adult human male 
can be classified. 

Sheldon’s classification of human physique is based on a three-component sys- 
tem. The first component, endomorphy, is indicated by the relative predominance 
of soft roundness throughout the various regions of the body. Mesomorphy implies 
relative predominance of muscle, bone, and connective tissue. The third component, 
ectomorphy, is described as the relative predominance of linearity and fragility. 
Sheldon assigns a numeral 1 to the least degree of a component and a 7 to the maxi- 
mum manifestation of each component. Each pattern or somatotype is described by 
three numerals, one for each position relative to the scale for each component. The 
first digit is used to indicate endomorphy, the second mesomorphy, and the third 
ectomorphy. The three body types or patterns hypothesized for examination were 
those corresponding most closely to Sheldon’s types 711, 171 and 117. In brief, it 
was assumed that there are three differentiable somatotypes corresponding to Sheld- 
on’s patterns of extreme endomorph, ectomorph, and mesomorph. However, the 
variables defining these patterns were by no means restricted to Sheldon’s three 
components. 


PROCEDURES 


The population consisted of photographs of hospitalized male psychotic patients. 
Each photograph showed the nude subject standing erect on a pedestal in three 
aspects: anterior, lateral, and posterior. Sound design in a factor analysis, whether 
direct or obverse, requires that (a) each hypothesized factor be adequately repre- 
sented by a minimum of three or four variables, and (b) factorially simple variables 
be chosen to define each factor. Towards the first requirement five individuals were 
selected out of a sample of 90 white males to represent each body type hypothesized. 
To satisfy the second requirement only the “purest’’ or most extreme members of 
each proposed body type were selected from the 90 photographs available. 

Using the three aspects of the subject’s photograph, linear morphological meas- 
urements were made by means of a pair of dividers with a delicate screw adjustment 
and a scale graduated to tenths of a millimeter. The 16 direct measurements were as 


1From the Veterans Administration Central Office, Washington, D. C. The authors gratefully 
acknowledge Dr. Richard L. Jenkins’ helpful comments, and the assistance of Frank A. Gross in the 
supervision of many of the computations. 

*The opinions and assertions in this paper are those of the writers and are not to be construed as 
official or reflecting the views of the Navy Department or the Naval Service at large. 
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follows: biacromial diameter, neck diameter, chest diameter (anterior), chest dia- 
meter (lateral), bicristal diameter, umbilicus to buttocks, trunk length, sternal 
length, symphysis height, arm length, lower arm diameter, thigh“diameter (lateral), 
thigh diameter (anterior), lower leg length, lower leg diameter, and foot length. 
Height and weight were taken from the information recorded on the photographs. 
In addition, ratings of fatness and muscularity were made independently by the 
authors on a six-point scale and averaged. Agreement between the ratings was very 
close. The ratings made were on muscularity in the region of the shoulders, trunk, 
leg, and arm; fatness in the region of the shoulders, trunk, leg, and arm; shoulder 
ectomorphy, and trunk volume. The remaining variables represent ratios based upon 
variables 1 to 27, inclusive. The 36 measures were used in determining the correla- 
tions between the 15 subjects. The standard error of a zero correlation with an N of 
36 is .169. 


TaBLE 1. THE INTERCORRELATIONS BETWEEN THE 15 EXPERIMENTAL SUBJECTS 
(DECIMALS ARE OMITTED) 








2 3 4 5 6 7 8 9 10 11 12 13 14 15 
727 793 336 182 -210 -013 -266 -485 -589 -220 -069 064 -388 -542 
751 360 513 -291 015 -203 24 -514 -339 -265 -098 -328 471 
650 286 -416 -142 -361 -696 -191 031 069 -212 
246 -732 -345 -586 -595 O76 274 309 
-186 -128 -166 -177 -428 -165 -185 
636 848 55 724 -528 -656 -723 
698 5 547 -6§26 -691 -632 
2 762 -576 -757 -769 
826 -461 -640 -713 
-394 -556 -621 
785 


CONOUrWhe 





The 36 measurements and ratios were converted to standard scores to establish 
a common metric. The intercorrelations computed between the 15 subjects were 
Pearson coefficients. The correlational matrix presented in Table 1 was factored by 
the centroid method. The third factor extracted was obviously residual since it was 
found to be linear with the second. The mean third factor residual was a -.025. The 
rotations to achieve simple structure were oblique. 


RESULTS 


The centroid and oblique factor loadings of the two person factors extracted 
are shown in Table 2. Person factor A is bipolar and is defined at either end by the 
endomorphs and ectomorphs. At one end of the factor is the individual who is fat, 
relatively heavy, has a low height to weight ratio, high trunk volume, wide hips, a 
low shoulder width to hip ratio, and a low trunk length to leg length ratio. The 
ectomorphic individual, at the other end of this continuum, is thin, weighs relatively 
little, has small trunk volume, narrow hips, a high height to weight ratio, and a high 
trunk length to leg length ratio. Muscularity is relatively absent in both groups. 

Factor B is defined by the mesomorphs as a group. They are highly muscular in 
all regions of the body, their shoulders are relatively wide, and the shoulder width to 
hip ratio is high. They tend to be between the other two groups in weight, chest 
width, trunk volume, height to weight ratio, and trunk length to leg length ratio. 

It is of some interest to consider Sheldon’s three components in the light of these 
findings. Sheldon reports the following correlations between his three morphological 
components for 200 normal males: first-second, —.29; first-third, —.41; second-third, 
~.63. For another series of 800 males his correlations were: first-second, —.28; first 
third, —.46; second-third, —.59. 
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TaBLeE 2. OrtHoconaL Facror Matrix F, Osrigue Factor Matrix V, AND TRANSFORMATION 
Matrix A. DEcmMAL PoINTS ARE 
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The intercorrelations can be represented geometrically in a very useful way. 
Let each morphological component be represented by a vector of unit length in 3- 
dimensional space. Then each correlation can be represented as the cosine of the 
angular separation between any two vectors. If the angle is 90 degrees the cosine is 
zero and if the angle of separation is 0 degrees, the correlation is unity.* 

Now if the three components were actually independent their intercorrelations 
would be zero and each vector would be separated by an angle of 90 degrees from the 
other two in 3-dimensional space. The angular separation of the three vectors would 
sum to 270 degrees. On the other hand, if the three components could be represented 
in only 2-dimensional space on a plane surface the angles separating the three vec- 
tors would of necessity sum to 360 d 

When Sheldon’s intercorrelations are translated into angles between unit vect- 
ors they sum to approximately 350 degrees in each of the two groups. Since the sum 
of the angles between any three vectors on a plane surface is 360 degrees, and the 
sum of angles here is 350 degrees, it seems more plausible to interpret Sheldon’s com- 
ponents in terms of 2-dimensions than 3. 

An examination of Sheldon’s components also suggest that they represent pat- 
terns of three groups similar to those identified in this study, rather than trait factors. 
Like patterns the components are defined in terms of ratios of more elementary 
— They represent patterns in which muscle or fat, leg length or torso length, 
combine. 


*Note that cosine (180° - ©) = - cosine 9. 
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, SUMMARY “ 

This study has demonstrated the exisvence of three distinguishable groups and 
two person body factors. The three groups exhibit morphological trait patterns that 
closely resemble the patterns descriptive of Sheldon’s components. The findings also 
suggest that the 76 somatotypes identified by Sheldon can more simply and economi- 
cally be defined in terms of measurements on only two type factors. 
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AN ITEM ANALYSIS OF CHILDREN’S DRAWINGS OF A HOUSE* 
SYLVIA MARKHAM 


Dept. of Psychiatry, Mt. Sinai Hospital, New York City 


INTRODUCTION 

Few statistical studies of the HTP with respect to intelligence have been made 
with children, nor has the drawing of a house been used as the sole instrument for 
investigation. Buck® asserts that it is possible to gain useful information con- 
cerning the subject’s intellectual level from his drawings, but its usefulness as a 
measure of intelligence is open to question. If it could be shown that a child’s draw- 
ing of a house bore some relation to mental age and/or chronological age, a useful 
tool might be developed which could supplement other intelligence tests. If such a 
relationship existed, it might be possible to use the test as a screening technique in 
which broad differences in intellectual functioning might be determined. 

The aims of this study were (1) to evaluate quantitatively the frequencies of 
the items that appear in children’s drawings of a house, (2) to determine to what 
extent developmental and intelligence factors affect changes in children’s drawing 
of a house, and (3) to examine the validity of the hypothesis that a child’s drawing 
of a house gives useful information concerning the child’s intellectual and develop- 
mental levels. A tabulation of items found in children’s drawings of a house dis- 
closed the necessity of making some revision in Buck’s quantitative scoring (which 
is intended primarily for adults). Buck’s scoring point scheme was used as the basis 
for this study, but certain of his items were dropped, other items were added, and 
some items were combined. 


METHOD 
The subjects for this study were 136 children in the kindergarten through the 
fourth grades in Public School # 1 in Lawrence, Long Island, New York, representing 
a middle class socio-economic level. The ages of the children ranged from five years 
no months to nine years eleven months. For the final evaluation, 100 children were 


*The author wishes to express thanks to Dr. Fred Brown, Chief Psychologist, Mt. Sinai a 
under whose mentorship this paper was prepared, and to Donald MacDonald, Principal and mem 
of the teaching staff of Public School No. 1, Lawrence, L. I. for their cooperation during this study. 
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used; 20 at each age level, from five years through nine years, selected at random. 
The test was administered individually, with no one in the room except the child and 
the examiner. IQ’s were obtainable for all the children except the five year group. 
In considering the effect of intelligence on the drawing of a house, the five year old 
children were omitted. In the iinal summary, of the 80 subjects used, the 1Q’s were 
obtained in the following way: 3 from the Stanford Binet, 18 from the Otis Quick 
Scoring Mental Ability Test-Alpha Form, and 59 from the Pintner-Cunningham 
Primary Mental Test. Of the 80 children, 12 were dull (IQ between 70 and 89), 40 
were average (IQ between 90 and 109), and 28 were superior (IQ between 110 and 
over). The children’s art instruction did not include any formal training in the draw- 
ing of a house so that no stereotype had developed and the factor of learning was 
minimized. 
RESULTS 


Since gross differences were revealed in the drawings of the five year old child as 
compared to the six year old, the five year old group was treated separately, and 
their drawings were excluded from the summaries of frequencies. The factors that 
distinguished the drawings of the five year old group were (1) their tendency to 
draw flat roofs, (2) difficulties with perspective, (3) misplaced roofs, (4) difficulties 
in proportion, and (5) poor motor coordination. 

Good and flaw items were compiled in their order of frequency with the following 
showing a frequency of 50% or more; (1) two-dimensional roof, (2) chimney, (3) two 
walls, (4) door, (5) door-knob, (6) three or more windows, (7) windows in correct 
alignment, and (8) window panes. The two most common flaw items were paper 
basing (frequency of 48°) and malproportioned windows (frequency of 43%). All 
other flaw items appeared with relative infrequency (18% or less). 

Changes in the drawing of a roof from the five year old level to the ten year old 
level appear to be influenced by both age and intelligence. At five years the child 
seems only capable of drawing the flat roof, although a few children (28%) attempted 
to transform the straight line into an oblique V with one end pointing upward. The 
two dimensional roof (the combination of an inverted V and a rectangular roof) is 
closely related to both age and intelligence. The chimney shows extreme variability 
with a frequency of 50°; at five years, 60% at six, 90% at seven, 75% at eight, and 
50°, at nine. Wall, chimney and roof material appear related to intelligence but 
emotional factors may account for its inclusion in the drawing. Paper-basing was 
such a frequent error that it is questionable whether it should be construed as a flaw 
in children’s drawings. The door was drawn by almost every child (93°) while the 
door knob and the door panel were drawn more frequently by the brighter children 
at all ages. The failure to draw windows in the main wall of the house was a flaw 
which showed up with greater frequency in the drawings of the dull and the younger 
children. The older and brighter children were more likely to include window de- 
tails in their drawings. The capacity to draw windows in correct alignment was a 
function of maturational level and not intellectual level. Well-spaced windows (not 
too many or too few) showed a significant relationship to intelligence but not to age. 
Details such as porch, steps, walkway, shrubbery, garage, and ground-line made no 
appearance at all in the drawings of dull children. Good motor coordination was 
most frequent in the drawings of the superior children in the eight and nine year old 
group. “Good placement on the page showed a significant relationship to the child’s 
maturational level. Very few transparencies appeared in the drawings, a total of 
three out of one-hundred. Drawing the house slightly or grossly off the vertical 
axis was a frequent flaw: 25% of the six and seven year old group and 27% of the 
older children made this error. 

Factors that are Statistically Significant in Revealing the Intellectual Functioning 
of the Child. The differences in the frequency of items in the drawings of dull, average 
and superior children form a continuum. The differences in intellectual functioning 
of the three categories are gradual, the upper end of one group merging impercept- 
ibly into the lower end of the next group. The average group was therefore omitted 
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from this study. Between the dull subjects, N equal 12, and the superior ones, N 
equal 28, certain items revealed statistically significant differences. A two-dimen- 
sional roof, a house with two walls, and windows that were well-spaced, were the three 
items that showed significant differences between the dull and the superior child, 
with a probability of less than .05. A door-knob, a window in the attic and good 
motor coordination showed significant differences with a probability between .05 
and .10, and therefore might be used to indicate a trend of lesser significances. All 
the other differences were beyond the .10 level of probability. 


Factors that are Statistically Significant in Revealing the Developmental Level 
of the Child. In comparing the drawings of the younger group (six and seven year old 
children, N equals 40) with the older group (eight and nine year old children, N 
equals 40), certain differences stand out as statistically significant. The older group 
showed significant superiority in the following good items; a two-walled house, cor- 
rect alignment, good placement, and a window in the attic (P equals .01). The differ- 
ences were also significant in the frequencies of the two-dimensional roof and window 
shades or curtains, (P equals .05). Of lesser significance, with a probability between 
.05 and .10 was the drawing of a door panel and three or more windows. Unexpected- 
ly, the younger group showed superiority in drawing a door on a vertical axis. A 
slanting door appeared more frequently in the drawings of the older group, showing 
a statistical difference with a probability of less than .05. Emotional stresses of the 
eight and nine year old child may have been responsible for this flaw. 


SUMMARY AND CONCLUSIONS 
1. The drawings of a house of five year old children differ fundamentally from draw- 
ings of older children, and should be analyzed and scored on a separate scale. 
2. Eight items that show a frequency greater than 50° may be assumed to be 
basic in children’s drawings of a house. Omission of one of these items would con- 
stitute a serious flaw in the drawing. Good items showing less frequency at certain 
age levels might not be considered flaws for that age. 


3. Many items show a relationship to levels of intelligence but only three of these 
show a statistically significant difference at the .05 level of confidence between the 
dull and the superior child. 


4. Many items show a relationship to the developmental level of the child, but only 
six of these show a statistically significant difference at the .05 level of confidence. 
One flaw item showed a significant increase with increasing age. 

5. Emotional factors may explain some of the distortions and omissions in the 
drawings. 


A house drawing appears to be an inadequate technique for measuring the in- 
tellectual level of a child since only three items (a two-dimensional roof, a two-walled 
house and well spaced windows) show significant positive correlation with mental 
age. These items might be used as differentiating factors in making a gross distinc- 
tion between dull and superior children. Some promise lies in a study of the qualita- 
tive features of the maturational process of the drawing with particular reference to 
behavioral implications. The striking changes that appear in the drawing of the six 
year old child as compared to the five year old child suggest that an important matur- 
ational advance takes place at this time which represents a different level of psycho- 
motor functioning. The house drawings might be utilized as part of a battery of 
tests at the sixth year level to determine readiness for second grade work. Lastly, in 
evaluating the house drawing of an adult, in addition to obtaining useful informa- 
tion about personality structure and level of adjustment, it would appear legitimate 


to utilize the drawing as a means of assessing the degree of arrest or regression in 
emotionally disturbed individuals. 
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RORSCHACH CONTENT AS A FUNCTION OF PERCEPTUAL 
EXPERIENCE AND SEX OF THE EXAMINER* 


ALBERT RABIN, WILLIAM NELSON AND MARGARET CLARK 
Michigan State College 


INTRODUCTION 


The stability of the Rorschach profile or pattern and of the factors constituting 
it has been of great concern to workers with the technique as well as to critics of it. 
To what extent the record reflects stable personality factors, and to what extent it is 
subject to passing and temporary effects and experiences, is of interest to both in- 
vestigators and practitioners. 

In order to attempt a partial answer to these broad questions, limited objectives 
have been set in this study. These objectives were to determine the influence of some 
incidental perceptual experiences, immediately preceding the Rorschach examina- 
tion, upon the two important content categories of anatomy and sex. The selection 
of these content categories was governed by the consideration of their importance in 
the conventional interpretation of Rorschach records, especially those of psycho- 
pathological subjects. Recent reports on large normal adult samples®? indicate the 
comparative scarcity of Anatomy and Sex responses. 

Implicit in Rorschach theory is the concept that the product obtained with the 
test, that is, the protocol, is insignificantly affected by perceptual experiences im- 
mediately preceding the examination. It is considered to project the more stable and 
consistent “‘private world”’ of the individual. This leads us to formulate the following 
null hypothesis: 


Visual stimuli representing anatomy and sex, to which subjects are exposed immediately 
prior to the Rorschach examination, have no effect upon the incidence of records in which 


anatomy and sex responses are produced. 


PROCEDURE AND SUBJECTS 


The Rorschach test was individually administered to 66 male undergraduate 
students who volunteered for the experiment. The subjects were divided into three 
groups. Group A, the anatomy group containing 16 students, group S, the sex group 
was made up of 30 students and group C, the control group included the remaining 
20 subjects. No known factors other than chance governed the placement of the 
subjects in the several groups. There were no significant differences between the 
mean ages of the groups. 

Each subject of group A was placed in a “waiting room”, decorated with an 
array of anatomical charts, pictures of operations and with photographs from medical 
literature. The waiting period lasted seven minutes. Following this seven minute 
waiting period each subject was conducted to an ordinary examining room for the 
Rorschach test administration. 

Each subject of group S was placed in a waiting room decorated with photo- 
graphs of nude and semi-nude females and other “‘sexy”’ pictures. Otherwise, the 
same procedure as with group A was followed. 

Group C waited in an ‘‘undecorated”’ waiting room prior to the Rorschach ex- 
amination. Following the seven minute wait they too were conducted to another 
room for the testing. 

Half of the subjects in each group were examined by a male examiner and the 
other half, by a female examiner. This division was performed in random fashion. 


RESULTS 


A report of the number of subjects, in each group, who gave sex and anatomy 
responses is given in table 1. The incidence of those who produced anatomy res- 


*Presented at the 1952 meetings of the Midwestern Psychological Association, Chicago, Illinois. 
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ponses in the three groups shows no striking differences. In this respect the A group 
assumes a middle position between S and C, especially when the percentages of in- 
cidence are considered. Consequently, it appears that the anatomy stimuli affected 
no increase in the incidence of individuals producing that type of response. Thus, 
with respect to anatomy responses, the null hypothesis is borne out. 


TABLE 1. INCIDENCE OF ANATOMY AND SEX RESPONSES 








Anatomy 





Group Y | No. Cases % 





Anatomy (A) $8 
Sex (8) 73 
Control (C) ‘ 95 














The incidence of individuals producing sex responses is at least three times as 
great as that found in the two groups. The computed chi square of the differences 
between the 8S and A & C groups shows a significance close to the 5 percent level. 
The null hypothesis may be rejected, but not overwhelmingly so. It may also be 
speculated that were it not for the differences in the sex of the examiners, to be dis- 
cussed below, greater incidence of sex responses in the S group would have been 
obtained. It may be added, parenthetically, that the norms reported °) with regard 
to sex and anatomy responses do not hold up with a college student population. 

The effect of the sex of the examiner on the content categories studied was ob- 
tained as a by-product of the investigation. From table 2, it is quite obvious that the 


TABLE 2. INCIDENCE OF ANATOMY AND SEX RESPONSES IN RECORDS OF 
Born EXAMINERS 








Anatomy Sex 

Group M F 
Anatomy (A) 7 7 
Sex (8) 11 ll 








Control (C) 9 10 











sex of the examiner had no effect on the elicitation of anatomy responses. However, 
it is equally obvious that the female examiner apparently inhibited the male sub- 
jects and prevented them from producing sex responses. Only one subject of the 33 
tested by the female examiner produced sex responses; he was a member of the S 
group. The male examiner, however, obtained sex responses in the records of in- 
dividuals in all three groups. Eleven of his 33 subjects produced sex responses com- 
pared with the single subject tested by the female examiner. The difference in in- 
cidence is highly significant statistically (P = .01). 


DIscussION 


The literature dealing with directly and consciously induced ‘‘sets’”’ and their 
influence upon Rorschach performance is irrelevent to the present paper and of 
questionable relevance to Rorschach hypotheses in general. 

The work of Norman et al“) is of greater relevance to the present study. These 
workers induced “sets” for food and movement by presenting their subjects with 
magazines dealing with these topics. They found no alteration in the responses of 
their subjects as a result of this experience preceding examination. These findings 
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agree with our findings with respect to anatomy responses. However, in our experi- 
ment, the sex stimuli were apparently much more powerful so that greater productiv- 
ity in this content category was evoked. 

Two studies, with somewhat contradictory results, deal with the effects of sex 
of the examiner upon the production of sex responses. Alden and Benton“? found no 
significant differences in the incidence of ‘frank or hidden”’ sexual content, in records 
of psychiatric patients, between a male and female examiner. Since our study deals 
with clear and “frank”? sex responses, our results are not directly comparable. It 
may also be that the psychiatric patients do not show the same level of inhibition as 
do young college students in the presence of a female examiner. On the other hand, 
the results reported by Curtis and Wolf are in greater agreement with the present 
findings, though they dealt with “covert” as well as overt sex responses. 

SUMMARY AND CONCLUSIONS 

Perceptual experiences immediately preceding the Rorschach examination tend 

to affect the incidence of individuals giving sex responses. A greater proportion 

of subjects produced sex responses after being exposed to sex pictures than those 
who were not similarly stimulated. 

Subjects exposed to anatomy pictures, just prior to the examination, who gave 

anatomy responses are not proportionately more numerous than those who 

were not exposed to the same pictures. 

The sex of the examiner is an important factor in affecting the elicitation of sex 

responses. The female examiner tends to inhibit the production of sex responses 

in the male subjects. No similar effects are noted with respect to anatomy 
responses. 
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A COMPARATIVE STUDY OF RORSCHACH SCORING METHODS IN 
EVALUATING PERSONALITY CHANGES RESULTING FROM 
PSYCHOTHERAPY 
ARNOLD O. D. PETERSON! 


The Pennsylvania State University 


PROBLEM 


The purpose of this study was to compare lists of Rorschach signs used in re- 
search to evaluate personality changes in individuals undergoing client-centered 
therapy. 

Conflicting claims have been reported in the literature regarding the efficacy of 
Rorschach signs of adjustment depicting personality changes in individuals under- 
going psychotherapy. Muench? in 1947 first reported Rorschach evidence of 
personality change in all of twelve cases receiving non-directive psychotherapy. 
Hamlin and Albee“ provided Muench with a control group study. In 1949 Carr“ 
published a study which, in using twenty signs of adjustment considered essentially 
the same as Muench’s signs, did not support Muench’s results. 

This study was conducted in partial fulfillment of the requirements for the M. S. degree, The 
Pennsylvania State University, 1953. 
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Haimowitz® in 1948 conducted an elaborate Rorschach study which indicated 
significant improvement in her cases with the use of Harrower-Erickson’s Neurotic 
Sign Index. A study by Mosak“?, using Carr’s signs of adjustment, found the Ror- 
schach of little value as a pre- and post-therapy test. In 1951 Tucker“ found that 
the Rorschach did not give significant correlations with a multiple criterion measure 
of success in therapy. 

From one study of patients undergoing psychoanalytically oriented psycho- 
therapy, Piotrowski and Schreiber“ emphasized that the qualitative change in 
Rorschach responses was more important in reflecting personality changes than the 
quantitative changes. 

In each study except the study of Piotrowski and Schreiber the lists of Ror- 
schach signs of adjustment varied in number from twelve to 22 signs. A normalcy 
gain score or critical score of adjustment was established beforehand for each sign 
in the lists. Then, from the patient’s pre- and post-therapy Rorschachs, the actual 
scores for each sign were determined. Indication of personality change was inferred 
if there was a significant gain (Fisher’s t-score) in normalcy gain scores of the post- 
therapy Rorschach over the pre-therapy Rorschach. Also, the difference between 
the number of improvement (increment) and the number of decrement scores for the 
sign lists was determined, since it might be possible for the post-therapy Rorschach 
to show changes without meeting the normalcy gain criteria. 

Differences in the specification of the Rorschach signs of adjustment were 
noted. In comparing the signs of Muench and Carr, Muench’s signs seemed to be 
specified more qualitatively or rigorously than Carr’s signs. For example, both the 
M and W of Muench’s list were based only on better than average responses, while 
Carr’s M and W% were purely quantitative and considered all M’s and W’s. It was 
judged that Haimowitz’s signs showed greater qualitative specification than Carr’s 
signs and less than Muench’s signs. 


PROCEDURES 


The writer applied the Rorschach evaluation methods (sign lists) of Muench; 
Haimowitz, and Carr to pre- and post-therapy Rorschach records of 42 subjects of 
the Psychotherapy Research Group of The Pennsylvania State Uuiversity?. 

An independent, objective criterion of success in therapy was a composite 
criterion score derived by Tucker“ for each subject. Tucker’s individual criterion 
scores were correlated with the Rorschach change scores of Muench, Haimowitz, 
and Carr in two ways: (a) as improvement-decrement changes, and (b) as changes in 
normalcy gain scores. 

The hypothesis stated in specific terms indicated that the degree of correlation 
for improvement-decrement scores and for normalcy gain scores from highest to 
lowest between the sign changes of the various methods with Tucker’s criterion 
scores would be in the order of Muench, Haimowitz, and Carr, an order which re- 
flected the judged degree of qualitative specifications. 


RESULTS 


Improvement-decrement scores gave correlations with Tucker’s criteria in the 
predicted order, viz. Muench (r of .127), Haimowitz (r of .048), and Carr (r of .006). 
But all correlations were so low that judgment of the relationship between methods 
must be guarded. 

Normalcy gain scores gave correlations with Tucker’s criteria which were con- 
trary to the hypothesis, viz. Carr (r of .260), Muench (r of .123), and Haimowitz 
(r of .035). However, again, all correlations were so low that judgment of the re- 
lationship between methods must be guarded. 

The mean differences between group pre- and post-therapy Rorschachs for the 
number of normalcy gain scores showed no significant results. On the improvement- 
decrement scores, only Haimowitz’s scores gave a significant mean difference at the 


*The author wishes to express his sincere appreciation to the nine members of the Psychotherapy 
Research Group for permission to use the data which they collected. 
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1 per cent level of confidence. Haimowitz’s scores showed the greatest change toward 
improvement with both normalcy gain and improvement-decrement scores. 


CONCLUSION 
The hypothesis that Rorschach signs of adjustment which incorporate greater 
qualitative specification will correlate to a higher degree with an independent meas- 
ure of success in therapy than will the more quantitative methods, is not supported 
by the results of this study. 
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NORMS FOR THE MINNESOTA MULTIPHASIC PERSONALITY 
INVENTORY WITH STUDENT NURSES 


CHARLES A. WEISGERBER, S. J. 
University of Detroit 


PROBLEM 

The need of special norms for college groups on the Minnesota Multiphasic 
Personality Inventory has been indicated by Brown.“ Lough®? and the present 
writer“) have found that groups of student nurses obtain appreciably higher means 
on some of the scales than the normative population. Although it is possible to take 
this fact as an indication that nurses verge a little toward abnormality, a more likely 
interpretation is that the general norms are not perfectly applicable. Hence the writ- 
er has prepared the accompanying set of T scores. 


PROCEDURE 

The normative group consisted of 168 girls enrolled in a three-year nursing cur- 
riculum at two moderately large hospitals affiliated with Loyola University, Chicago. 
St. Elizabeth Hospital, Chicago, supplied the protocols of 39 freshmen; St. Francis 
Hospital, Evanston, Illinois, supplied the protocols of 58 freshmen, 36 juniors, and 
35 seniors. The mean age of the group was 19-1; the range, 17 to 26. The group form 
of the test was used; it was administered shortly after the beginning of the school 
year. 

RESULTS 

The T scores, which were computed according to the usual formula, are pre- 
sented in Table 1. Norms for freshmen were also prepared, but they are omitted 
here because they differ very little from those for the entire group. The reason for 
the similarity can be seen in the close equivalence of means and standard deviations 
reported in Table 2. 
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TABLE 1. NorRMS ON THE MINNESOTA oo PERSONALITY INVENTORY FOR 
SrupEent NursEs (N = 168) 








T Scores 
Hs* D Hy Pd* Mf 





13 
15 


COR NWOk AAINWO 








*On these scales the K correction has been added to the raw score. 
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TABLE 2. Raw Score MEANS AND STANDARD DEVIATIONS OF THE 
FRESHMEN AND OF ENTIRE GROUP 








Freshmen Entire Grou 
fean SD Mean 


12.361 3. ‘12.607 3.254 
3. 18.530 3.699 
19.247 4. 19.631 4.359 
3. 
4. 





18.227 
20.742 20.821 3.642 
730 34.101 4.603 
Pa 8.299 i 8.494 2.704 
Pt* 27.031 .573 26.887 4.634 
Sc* 25.443 .582 25.161 5.278 
Ma* 20.969 4.263 21.065 3.971 


33.794 











*On these scales the K correction has been added to the raw scores. 


These special norms have their advantages and their disadvantages. Those 
who score rather close to the mean for nurses will be represented as close to a T score 
of 50. It then will not be necessary to make a mental correction to allow for the fact 
that they are not appreciably deviant within their own group. For example, on the 
Ma scale the mean raw score for nurses is 21.06, which according to the tables in the 
MMPI manual would be equivalent to a T score of 60. Such a T score creates the 
appearance of a fairly great deviation, whereas actually it is the mean for the group 
in question. 

On the other hand, a student who deviates considerably from the nurses’ mean 
will appear to be even more deviant, except on the Mf and Ma scales. By way of 
example again, on the Hs scale a raw score of 23 would yield a T score of 70 accord- 
ing to the MMPI manual, a T score of 82 according to the norms in Table 1. The 
reason for this difference is, of course, the fact that the standard deviation of a select- 
ed group is generally somewhat less than that of a broader population. Such exagger- 
ation of extreme scores is quite pronounced on the Hs, D, and Pa scales. 


SUMMARY 


Clinical experience has indicated the need for special norms on the MMPI for 
college students and other groups such as student nurses. This paper presents MMPI 
T scores secured from a normative group of 168 student nurses. The scores of 
freshman student nurses did not differ significantly from those of the whole group. 
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RECORDING THE RORSCHACH PROTOCOL 
ROBERT M. ALLEN 


University of Miami 


Essentially, recording is desirable in order to capture and hold the meaningful 
nuances which may be entirely lost in the usual method of hand note taking by the 
tester. After considerable experimentation with various types of recording devices 
the author finds greatest satisfaction with the disc type of audio recorder. The disc 
can be filed in the subject’s case folder without appreciably increasing the cost of 
testing. Furthermore it assures a permanent verbatim record for scoring, interpret- 
ing, follow up study, research, and is excellent staff conference material. It also 
serves as a facile teaching medium in an introductory course in the Rorschach 
technique. The author finds this procedure valuable for appraising the performance 
of cerebral palsied patients. It frees the examiner from note taking and affords in- 
creased opportunities for observing behavior in the testing situation. Thus it is 
possible to see many little things not otherwise attended to in the multiplicity of 
tasks necessary for obtaining an adequate protocol. 

The writer has been experimenting with the possibilities of recording Rorschach 
protocols for the past year. This procedure is not new in counseling research and it is 
being used successfully with the TAT. However, the Rorschach test does not lend 
itself to complete machine recording because of the Inquiry. This phase of the test 
calls for verbatim repetition of the testee’s free associations prior to inquiry into the 
location and determinant(s) for each percept. It appears that two machines are 
needed, one to play back the free association response and the other to record the 
subsequent Inquiry data. The other alternative would appear to be a continued 
jockeying back and forth on the recording wire, tape, or disc between the free asso- 
ciation percept and the place on the wire, tape, or dise where the Inquiry is to be 
recorded. The first of these methods is too expensive in that it ties up two machines 
and two spools of wire or tape, or two discs. The second alternative results in a messy 
record and in frequent wire and tape breakage. 

A third method emerged from repeated trials with the dise type of audio record- 
ing machine. This procedure produced a well-recorded protocol with the least 
amount of difficulty and disturbance in the testing atmosphere.! The optimal pro- 
cedure is as follows: After the usual rapport talk, the testee’s permission for record- 
ing his responses is requested. A brief explanation may be necessary in some in- 
stances but the tester will experience little difficulty in securing the desired consent. 
The subject is given the Rorschach test directions“: PP: 2°?) and the free associations 
are written down by the tester. Upon completion of the free associations to all ten 
plates the testee is then introduced to the Inquiry phase“: PP. 85), He is also in- 
formed that the machine recording starts at this point and will continue to the end 
of the testing period. The subsequent verbal give and take between the testee and 
the tester is machine recorded.2. The tester then reads from his written notes the 


‘It was noticed that some subjects, especially anxious neurotics, were visibly disturbed along with 
the examiner when the recording procedure was interrupted by breakages and entanglements. 

2A conference type microphone with a very sensitive pickup and a hand operated snap switch to 
start and stop the machine is best in this situation. The tester te two choices of how he will record. 
He can snap the switch “on’’ and leave it run until the end of the complete test, or he can operate 
the snap switch so as to record only when actual conversation is going on. In the former instance the 
playback time will be the same as the original recording time with large gaps of silence. This may re- 
quire more than one one-hour disc. In the latter case, starting and stopping the machine, playback 
time is shorter and an entire Rorschach test may be completed on one side of a one-hour disc or at 
most both sides may be required. The major difficulty here is to anticipate when the testee will begin 
to speak and complete verbalizing his ideas. This may be a bit more disturbing to the neurotic patient. 
With experience, however, a modification of these two extremes may be possible. Turning the dise or 
changing discs takes very little time and may be accomplished smoothly by engaging the subject in 
trivial conversation while making the change. For playback a foot-type switch is most convenient 
since most are equipped with a device for starting and stopping the machine and also a gadget for 
positioning the disc to play at any desired point. 
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subject’s first response to plate I but prefaces this repetition with the statement, 
“On card I your first response was....” As the testee indicates the location of the 
percept it is recorded directly on the Location Chart in the usual manner. It is 
important that the tester preface his inquiry into each percept with identifying 
numbers: the plate number is spoken when each plate is placed in front of the sub- 
ject and the response number for each percept in the particular plate is verbalized. 
Thus, the tester says: “On card IV your first response was... .’’ The next response 
to the same plate is prefaced with, ‘“The second response was... .”’ Another re- 
quisite is to mark off the location for each response in each card and to identify each 
with the same response-number as spoken into the microphone. In this way the 
tester will recognize the Inquiry (location, determinants, and content) data for each 
response when playing back the dise with the Location Chart in front of him. For 
those additional new responses given in the Inquiry the tester makes the usual 
inquiry and identifies the percept on the recorder and on the Location Chart with 
‘“‘la.”” A second additional response is labeled, ‘‘2a.”’ 

Testing-the-Limits is similarly recorded. After arranging the plates as required 
by this phase of the test “> »P- **!) the tester asks the questions to be answered by 
the testee’s choice of percepts. The tester speaks into the microphone the plate 
number and the location of the percept after Beck®) or Hertz“. This should be 
supplemented by noting the location on the Location Chart. 

Further uses of this technique include recording clinical impressions and test 
behavior observations. The final interpretation may then be added to the record to 
await convenient transcription. This method does not eliminate note taking but 
confines it to the free association stage only. It does enable the examiner to pay 
more attention to the other more important facets of the interpersonal and social 
situation labeled ‘‘test administration.” 
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EDITORIAL OPINION 





ETHICAL RESPONSIBILITIES OF PSYCHIATRISTS 


In the conflict of interests of clinical psychologists and psychiatrists, the psychia- 
trist has traditionally held a position of dominance based on the support of organized 
medicine, more intensive clinical training, longer established status in the field, 
control of clinical facilities, and the near monopoly on the healing arts granted by 
law. In all areas of specialty practice, the medical profession has zealously defended 
its intrenched interests and hes tended to place other professions on the defensive 
by mounting an aggressive attack upon any groups which attempt to invade the 
ever more broadly defined field of medicine. In this situation there has been some 
tendency for organized medicine to impugn the motives and ethical standards of 
clinical psychologists (as well as other specialists) who seek to invade the clinical 
field under auspices independent of medical supervision and control. The AMERICAN 
PsycHo.oaicaL AssociATION has accepted its responsibilities for the regulation of 
clinical practice by preparing a code of Ethical Standards for Psychologists which 
establishes a very high standard for professional conduct second to none of any other 
professional code. The medical profession, and psychiatry in particular, may feel 
reassured that professional psychologists will conduct themselves with irreproach- 
able ethical standards. 

The purpose of this editorial is to examine certain ethical responsibilities of 
psychiatrists toward clinical psychology which have not been given wide attention. 
The first general responsibility of psychiatry is toward clinical psychology viewed as 
its basic science. In contrast with other medical specialties, psychiatry is not founded 
currently upon any universally recognized and comprehensive basic science. Some 
attention is given to basic neurology, abnormal psychology, psychopathology and 
theoretical psycho-analysis but even toward these subdivisions there is no uniformly 
accepted weighting of curricula. There is even greater variation in attention given 
to basic science psychology of which some psychiatrists appear to be ignorant and 
uninterested. While some psychiatric training facilities have integrated clinical 
psychology into their program organically, and many others give lip service by 
having a clinical psychologist attached to the staff, the general situation is that 
psychiatry has not given proper emphasis to its basic science of clinical psychology. 
This is an ethical responsibility. 

A second ethical responsibility rests upon psychiatrists not to attempt to ad- 
minister or interpret tests and methods taken over from the field of clinical psycho- 
logy in which they have not had adequate instruction and are not personally com- 
petent. There are many instances of psychiatrists attempting to administer brief 
intelligence tests, Rorschach and other projective tests, and otherwise encroaching 
upon the field of competency of the clinical psychologist. 

The third ethical responsibility stems from the second and deals with the obliga- 
tion of the psychiatrist to make available and to effectively utilize the special ser- 
vices offered by clinical psychologists. The child guidance movement has long 
emphasized the importance of the team approach, and has made available teams of 
psychiatrists, clinical psychologists and psychiatric social workers to operate each 
within their own particular competencies. Mental hospitals have lagged behind in 
developing clinical psychology departments, and in utilizing the psychologist for 
handling many types of problems which do not require medical training. And 
psychiatrists in private practice, with few exceptions, have failed to avail themselves 
of the services of clinical psychologists in group practice. In fact, psychiatrists have 
ee to use psychiatric social work more effectively than they have clinical psy- 
chology. 

Finally, there is an ethical responsibility of psychiatry to accept genuinely the 
integrity of other professions. If for no other reason than the research ability char- 
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acteristic of competent psychologists, psychiatry cannot afford to overlook the 
theoretical and practical significance of the contribution which clinical psychology 
can make. It should not be overlooked that in the field of interprofessional relation- 
ships as well as in child-raising, the main objective is to bring out the best rather 
than the worst. By emphasizing the potential contribution of clinical psychologists 
rather than harping on the misbehavior of a few charlatans, psychiatry and the 
medical profession can do much to bring clinical psychology to a position of respect 
and genuine achievement. 


F. C. T. 


BASIC ISSUES IN CLINICAL PSYCHOLOGY 
I. MATURATION versus TRAINING? 


Important practical decisions currently hinge in the fields of education and 
psychotherapy on the theoretical question of whether to place major dependence on 
maturation or formal training in seeking to promote healthy personality develop- 
ment. On one hand there are the theoretical proponents of maturation who postulate 
that maximum permissiveness and nondirectivism should be utilized in allowing each 
person to work out his own life and solve problems in his own way. It is hypothesized 
that in every personality there are unrealized “growth resources” which, if unblocked 
from emotional complexes and conflicts, will become potentiated with the person 
reaching a healthy stage of self-understanding and creative fulfillment. It is argued 
that each person knows best what is best for him, and that interference from out- 
siders will only inhibit self-expression and growth. It is expected that each person 
will experience plateaus or regressions of growth, but that passive handling offers 
the best chances for preserving mental health until personality growth gets under 
way again. Maturation theory stresses that learning will not take place until a state 
of “readiness” exists, and that it may be damaging to put pressure on a person until 
he is ready to undertake action himself. 

Stemming from Behavioristic emphasis upon the importance of environmental 
stimulation of learning processes is the other theoretical position which insists that 
only through comprehensive and more or less formal training can the highest develop- 
ment of personality occur. Learning theory would admit that neurological matura- 
tion is a prerequisite for learning but would also insist that the highest achievement 
is potentiated only through training. This viewpoint would insist that the untutored 
person is deficient in the knowledge and experience which would make it impossible 
for him to make intelligent decisions concerning what is best for him. It would insist 
that each young person should be exposed to the widest possible variety of training 
experiences so as to sample many areas not otherwise available to him. Unfortunate- 
ly, insufficient experimental evidence exists in support of either extreme position so 
that current practical decisions must be based on empirical judgment. 

That the issue is not academic will be known to every parent who faces many 
decisions such as whether to ignore bad conduct on the basis that it represents only 
a phase which the child will out grow, or will he? Should a child be allowed to select 
his own courses in school, and to budget time spent in each? In the care of juvenile 
delinquents, should we spare the rod in the hope that showering the transgressor 
with love will help him mature beyond such behavior, or should he be held account- 
able for his actions and punished in some way? In graduate education, for example 
in medicine, should we follow the European system of allowing each student to con- 
centrate on courses in his specialty or the American system which insists that every 
student be proficient in every specialty? Should young people be allowed to explore 
sexuality freely or should formal education and supervision be provided? Will an 
emotionally disturbed person be able to stabilize himself through his growth re- 
sources if he is just handled permissively and nondirectively? Or on international 
levels, is it best to be tolerant of the transgressor in the hope that he will learn the 
error in his ways or should we oppose force and engage in intensive psychological 
warfare for the control of men’s minds? 

Be Sa Bs 
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in its magnitude and adequacy that it will undoubtedly become the established refer- 
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the authors had restrained themselves from making moralistic interpretations of 
their data with reference to the evolution of social attitudes towards sex. While this 
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studies to their logical conclusions of the discovery of oblique and second-order fact- 
ors rather than being left at the level of rotated orthogonal structure. He believes 
that three main dimensions of personality have been established: Neuroticism, Extra- 
version-Introversion and Psychoticism. These dimensions are relatively orthogonal 
to each other and also to “g’’. In his somewhat one-sided concentration on the search 
for the highest generalization in factorial analysis, one has the impression that he 
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Dr. Stephenson, Professor of Psychology at the University of Chicago, here pre- 
sents detailed discussions of Q-technique and its methodology. Q-technique is the 
factorial method for analyzing the correlations between persons, and which can be 
used to objectify the specificities which discriminate between persons. In contrast 
to Eysenck with his emphasis on generalization, the emphasis here is on specificity, 
the study of the factors which differentiate between men. The Q-sort method offers 
much promise for the future, and it is to be hoped that various Q-sorts will be quickly 
published and standardized. 
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York: Ronald Press, 1953. Pp. 885. 
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ume 1 includes a general discussion of the relation of psychology and medicine (Part 
1), some aspects of psychology (Part 2), and a comprehensive coverage of psycho- 
somatic relationships (Part 3). Volume 2 discusses the principal psychodiagnostic 
methods and their applications in medical practice. 


Dou, Epegar A. Measurement of Social Competence. Minneapolis, Minn.: Educa- 
tional Test Bureau, 1953. Pp. 664. 


Dr. Doll is a recognized authority in the fields of mental deficiency and the 
measurement of social competence which he defines as the functional ability for exer- 
cising personal independence and social responsibility. This book presents in great 
detail data concerning the standardization of the Vineland Social Maturity Scale. 
Part one discusses the philosophy of the method; part two outlines the construction 
of the scale and item specification; part three presents administration; part four dis- 
cusses normative standardization, item validation on feebleminded subjects, and 
scale validation. A concluding section discusses applications of the concept of social 
maturity measurement. 
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versities Press, 1953, Pp. 212. $3.00. 


Introduced with a forward by Dr. Greenacre, this monograph presents essays by 
EpWARrpD BrprineG on “The Mechanism of Depression’’, Epirn JAcoBsoN on ‘‘A Con- 
tribution to the Metapyschology of Cyclothymic Depression,” ExizaBeru R. Zet- 
zEL on “‘The Depressive Position,’’ GEorGE GEro on ‘‘An Equivalent of Depression: 
Anorexia,”’ and Maurits KaTan on “Mania and the Pleasure Principle: Primary and 
Secondary Symptoms.” All papers are psychoanalytically oriented. 


CavaNaGH, JoHN R. and McGouprick, James B. Fundamental Psychiatry. Mil- 
waukee, Wis.: Bruce, 1953. Pp. 582. $5.50. 


The authors are associated with Catholic University and Seattle University res- 
pectively and they have written a psychiatry text which is oriented upon the basis of 
Catholic philosophy. While the general classifications and symptom descriptions of 
scientific psychiatry are accepted, interpretation is oriented toward dualistic philoso- 
phy and revelations concerning Man’s Nature and Goals, e.g. God is man’s goal, and 
conformity to God’s will excludes psychic depression, conflicts, fear, anxiety and 
inferiority feelings. 


BonaParTE, Marte. Female Sexuality. New York: International Universities Press, 
1953. Pp. 225. $4.50. 


A summary of psychoanalytic theory relating to female sexuality. 
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159. $3.00. Some dialectic essays on dream life. 
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A digest of the late Harry Stack Sullivan’s concepts of socialization and accult- 


uration oriented from his viewpoint that psychiatry is the “science of living under 
the conditions which prevail in a given social order’. 


BossELMAN, Beutan C. The Troubled Mind. New York: Ronald Press, 1953. Pp. 
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A new TAT for African subjects consisting of 21 pictures and one blank card 
with drawings of African natives with suitable clothing and backgrounds. The 
manual describes test administration but with no standardization data. 
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